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INTELLIGENCE AND FAMILY SIZE 


ANNE ANASTASI 
Fordham University 


The past twenty-five vears have 
witnessed a growing interest on the 
part of psychologists, geneticists, 
and demographers in the relation- 
ship between intellectual level and 
family size. As a result of the nega- 
tive correlations commonly found be- 
tween intelligence test scores of chil- 
dren and number of siblings, several 
writers have predicted a drop in the 
intellectual level of the population. 
Data which have recently become 
available from a variety of sources, 
however, raise serious doubts regard- 
ing such a conclusion. Careful an- 
alyses of the problem, moreover, have 
revealed that it is far more complex 
—both methodologically and theoret- 
ically—than was originally supposed. 


Tne GENERAL QUESTION OF 
DIFFERENTIAL FERTILITY 

In current English demographic 
usage, the term ‘“‘fecundity”’ signifies 
capacity to produce living offspring, 
whereas “‘fertilitv’’ refers to actual 
performance (60). Some _ investi- 
gators have differentiated further 
among number of pregnancies, num- 
ber of births (including stillbirths), 
number of children born alive, and 
number of children alive at the time 
of the survey (71). For practical rea- 
sons, however, most have 
been concerned only with the last- 
named category. To be precise, one 
should also identify foster children 
and stepchildren within each family, 


studies 


but not all studies have done so. The 
number of such cases, of course, is 
relatively small and would not ma- 
terially affect the conclusions. 

Mention should also be made of 
the frequently reported ‘‘crude birth 
rate,’ which is simply the number 
of births during a year, per thousand 
population. Such a figure reflects 
not only mean size of family but also 
other demographic characteristics, 
such as the age distribution, sex 
ratio, and marriage habits of the 
population. Adjusted birth rates are 
sometimes computed, in which one 
or more of these characteristics are 
controlled. On the other hand, when 
average size of sibship is considered, 
as in most studies on fertility and 
intelligence, no account is taken of 
childless families, proportion of un- 
married persons, and differential mor- 
tality rates. It is thus apparent that 
fertility statistics can be variously 
expressed and that their specific im- 
plications need to be carefully scru- 
tinized. 

Most demographers agree that, al- 
though capacity for procreation may 
be influenced by genetic factors and 
by such environmental conditions as 
nutrition and climate, variations in 
fertility among existing human popu- 
lations arise chiefly from social and 
psychological factors (60, 96). Simi- 
lar explanations have been offered for 
progressive changes in fertility within 
given populations. In the United 
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States and in most European coun- 
tries, the birth rate declined steadily 
from the early or late nineteenth 
century to the 1930's, reaching its 
lowest point during or immediately 
following the economic depression. 
This decline has usually been attrib- 
uted to socioeconomic factors, such 
as urban migration, the rising stand- 
ard of living, and the increasing ex- 
pense of rearing children (96). Fol- 
lowing World War II, the trend 
was reversed, the crude birth rate for 
these countries rising markedly and 
reaching a peak in the late 1940's. 
Much of this rise is believed to be 
due to the abnormally large number 
of marriages during and immediately 
after World War II, as well as to 
births which had been postponed 
during the depression or early part 
of the war, or which were advanced 
from later vears because of the eco- 
nomic prosperity (96). Some demog- 
raphers, however, see evidence of a 
genuine increase in size of completed 
families (96). 

Differential fertility in various sub- 
divisions of a population was recorded 
as early as the seventeenth century 
in Europe (96). In a survey of British 
demographic writings appearing be- 
tween 1660 and 1760, for example, 
Kuczynski (52) found references to 
the greater fertility of rural as com- 
pared with urban dwellers, and of the 
poor as compared with the wealthy. 
The data upon which these early 
opinions were based are, of course, 
meager and difficult to interpret. In 
more recent times, however, a mass 
of data has accumulated which shows 
an inverse relationship between fam- 
ily size and such variables as income, 
occupational level, and amount of 
education (47, 61, 84, 96). The pre- 
viously mentioned decline in birth 
rate which began in the nineteenth 
century was more rapid in the upper 
than in the lower socioeconomic and 
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educational classes, thus either pro- 
ducing or augmenting such fertility 
differentials. 

Certain other findings regarding 
fertility differentials are of special 
interest. One of the most persistent 
differentials, occurring even when 
other subgroup differences in family 
size are absent or reversed, is that 
between urban and rural groups. 
This difference usually increases when 
the comparison is between agricul- 
tural and nonagricultural occupa- 
tions. The largest families are thus 
found among those rural residents 
who are engaged.in agriculture (96). 
Another suggestive finding, reported 
in an American survey, is that family 
size tended to be more closely related 
to the educational level of the wife 
than to that of the husband (5). 
Finally, it is noteworthy that a com- 
parison of native whites and Negroes 
in the United States reveals that the 
fertility rates of both groups are af- 
fected in the same way by such fac- 
tors as socioeconomic status, educa- 
tion, and urban-rural residence (53). 

All of the above socioeconomic 
and educational differences in fer- 
tility have been interpreted as in- 
direct evidence for a relationship be- 
tween fertility and intelligence. Inso- 
far as occupational and educational 
level are positively correlated with 
intelligence test scores, and urban 
scores average higher than rural, the 
inverse relation of these variables 
with family size suggests a similar 
inverse relation between intelligence 
and family size. Even prior to the 
direct study of the latter relation- 
ship, therefore, its implications had 
been considered by geneticists and 
demographers. 

There are, however, a number of 
noteworthy exceptions to the rela- 
tionships cited above. For example, 
several local studies in American, 
Canadian, French, German, Nor- 
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wegian, and Swedish cities have 
shown that the negative correlation 
between income and family size pre- 
vails only up to a certain income level 
(96). Moreover, within the higher 
income groups, there is sometimes a 
tendency for family size to be posi- 
tively correlated with income. Simi- 
larly, exceptions can be found to the 
hypothesized relation between fer- 
tility and occupational level (30). 

Special mention should be made of 
the Indianapolis survey, conducted 
in 1941-42 (48, 49, 50, 51, 106, 107). 
This study began with a short ques- 
tionnaire administered to approxi- 
mately 41,500 native, white couples 
living in Indianapolis, in which the 
wife was under 45 vears of age and 
neither had previous mar- 
riages. The major data were gathered 
in an intensive interview survey of 
about 1,500 women selected from the 


spouse 


larger sample because they met cer- 
tain additional specifications. The 
investigation was designed to test a 
number of hypotheses regarding so- 
cial and psychological factors affect- 
ing fertility. Many of the findings 
are difficult to interpret, however, 
because of the complex interrela- 
tionships and interactions of the vari- 
ables concerned (cf. 48, 51). Never- 


theless, one observation is of special 


interest in the present connection. 
Although the usual inverse relation- 
ship between socioeconomic indices 
and fertility was found over a large 
part of the range, this relationship 
changed from negative to positive 
within the highest income levels. 
Such a result held in both the total 
sample and the subsample chosen for 
intensive study. It was likewise 
found to be true in another subsam- 
ple including only wives aged 40 to 
44 years, who were more likely to 
have completed families at the time 
of the survey (50). 

One of the hypotheses proposed to 
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account for such results is based upon 
the relation of upward social mobility 
to family size (104). There is some 
direct evidence in support of this 
hypothesis. Baltzell (6), for example, 
found larger families among residents 
of Philadelphia listed in both the 
1940 Who's Who and the Social Reg- 
ister than among those listed only in 
Who's Who. He argued that the lat- 
ter had more often achieved their 
socially prominent position through 
their own efforts rather than having 
occupied it throughout life. They 
had thus shown more upward mo- 
bility than the Social Register group. 
Other analyses conducted within the 
same study, involving persons with 
a private school education and those 
whose families had been listed in 
Who's Who since 1900, likewise indi- 
cated greater fertility among the so- 
Attempts 
to test the hypothesis among inter- 
mediate socioeconomic groups have 
vielded inconsistent results. At such 
levels, it is difficult to find groups in 
which the drive for upward social 
mobility can be assumed to be negli- 
gible. For example, persons whose oc- 
cupational levels are higher than 
those of their fathers or who have ad- 
vanced in occupational level within 
their own vocational history have 
been compared with those who failed 
to show such changes (7). Even the 
“control” group which has mani- 
fested no occupational changes of 
this type, however, may have mani- 
fested upward mobility in other ways. 

It is apparent from various sources 
that, even prior to the postwar “baby 
boom,” generalizations regarding the 
inverse relationship of fertility to 
socioeconomic factors needed qualifi- 
cation. Following World War II, 
fertility increased relatively more 
among those groups in which the pre- 
war level had been low, i.e., among 
urban residents and persons in higher 


cially less mobile groups. 
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educational, occupational, and_in- 
come levels. Thus the increasing 
birth rate has had the effect of nar- 
rowing fertility differentials within 
the population. These changes are 
vividly illustrated by some of the 
data gathered in recent sample sur- 
veys by the United States Bureau of 
the Census (98, 99). For example, 
when married women between the 
ages of 15 and 44 were considered 
in the 1952 survey, no relationship 
was found between income level and 
total number of children, except for 
a proportionately large number of 
children in the lowest income group 
(under $1,000). When the same 
women were classified according to 
husband’s occupation, some reversals 
in the usual fertility differentials 
were observed. Thus, professional, 
semiprofessional, and managerial 
groups had proportionately more 
children than did certain lower occu- 
pational groups, such as clerical and 
sales workers.! Fertility differentials 


among educational groups have also 


decreased (98). The remaining dif- 
ferences between educational classes 
are still large, however, as are those 
between urban and rural groups. 

A similar narrowing of fertility dif- 
ferentials has been reported for a 
number of European countries (cf. 
16, 96). For example, there is evi- 
dence of an actual increase in family 
size in the upper social levels of Swe- 
den and Great Britain. Moreover, 
the decrease in fertility differentials 
holds for comparisons between coun- 
tries as well. Thus it is largely in the 


1The fact that significant differences in 
family size were obtained among occupational 
groups, while no such differences were found 
among income levels within the corresponding 
range, reflects the low correlation between in- 
come and occupational level. The range of 
income within each of the major occupational 
categories is very wide, and the income differ- 
ential between such occupational groups has 
been declining for many years. 
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low-fertility countries that the birth 
rate has increased. 

Mention may also be made of con- 
ditions prevailing within high-fer- 
tility areas. These are identified as 
regions having a crude birth rate in 
the vicinity of 40 per thousand, or 
higher (96). Such birth rates have 
been reported among peoples of 
Africa, Asia, and Latin-America; 
among non-European peoples of 
Oceania; and in certain parts of Eu- 
rope. For most of these groups, demo- 
graphic data are very inadequate. 
In general, the birth rate in these 
areas has shown no upward or down- 
ward trend, unlike the birth rate in 
low-fertility areas. Moreover, the 
usual fertility differentials have not 
been observed in high-fertility areas. 
In Brazil, for example, the urban- 
rural differential is the only one which 
follows the familiar pattern.  Re- 
ported socioeconomic differentials in 
that country are slight and not in the 
expected direction. Thus within each 
type of work, the fertility of employ- 
ers and own-account workers is gen- 
erally higher than that of employees 
and unpaid family workers (96). The 
data from high-fertility areas, how- 
ever meager, provide another major 
exception to the negative correlation 
between socioeconomic level and 
family size. 


STUDIES OF INTELLIGENCE TEST 
PERFORMANCE IN RELATION 
TO FAMILY SIZE 


While much has been inferred 
about the relationship between fer- 
tility and intelligence from studies on 
socioeconomic factors, a direct ap- 
proach to this problem has also been 
utilized. Current concern with this 
question on the part of demographers 
is illustrated by the inclusion of a ses- 
sion on ‘‘methods of research on rela- 
tions between intelligence and _ fer- 
tility’ in the World Population Con- 
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ference held under United Nations 
auspices in Rome in 1954 (97). Simi- 
larly, a special working group was 
called together by Unesco to dis- 
cuss the implications of fertility dif- 
ferentials for the intellectual develop- 
ment of the population. This group 
met in Paris early in 1954 in order to 
draw up a statement which was con- 
tributed by Unesco to the World 
Population Conference (17). 

Investigations on the relationship 
between family size and intelligence 
test score have been conducted in 
several countries, although the liveli- 
est interest in this problem has been 
manifested in Great Britain. Among 
such studies, the outstanding exam- 
ples are the Scottish surveys (81, 82, 
83). In each of these surveys, an ef- 
fort was made to test a complete age 
sample. In 1932 and again in 1947, a 
45-minute group intelligence test 
including verbal and pictorial ma- 
terial was administered to all 11-yvear- 
old children in Scotland. The sam- 
ples actually tested consisted of 
87,498 and 70,805 children in the 
first and second surveys, respectively. 
These samples are described as com- 
plete except for the children whose 
sensory or motor handicaps precluded 
the valid administration of this test, 
those school children who were absent 
on the day of testing, and a few chil- 
dren attending certain private schools 
for whom the necessary background 
data could not be obtained. In terms 
of the total population of 11-year-old 
Scottish children, the two samples af- 
forded approximately the same cov- 
erage. Since the estimated number 
of 11-year-old children in Scotland 
was 100,300 in 1932 and 80,300 in 
1947, the percentages covered in the 
two samples were 87 and 88, respec- 
tively. 

Surveys with individual tests were 
likewise conducted on more narrowly 
defined samples. All children born in 
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Scotland on the first day of February, 
May, August, or November, 1926 
were given the 1916 Stanford-Binet 
and a series of eight performance 
tests (63). This group comprised 
874 cases, tested over a three-year 
period at ages ranging from 8-11 to 
11-9. In connection with the 1947 
group test survey, 1,215 children were 
also tested with Form L of the 1937 
Stanford-Binet, as modified for Scot- 
tish children. This group, known as 
the ‘six-day sample,” included chil- 
dren born in Scotland on the first 
day of the even months in 1936. The 
coverage of the individual test sur- 
veys is more nearly complete than 
that of the group test surveys, special 
efforts having been made to reach 
every child who met the birthday 
specifications. Moreover, the 1,215 
children constituting the ‘“‘six-day 
sample” of the 1947 survey were stud- 
ied more intensively in terms of 
background characteristics. The 


same children are currently being fol- 


lowed up to determine the effect of 
family size and other factors upon 
their subsequent development (66). 
In the earlier Scottish surveys, no 
data on family size were obtained, al- 
though the results of these surveys 
were employed in the follow-up stud- 
ies to be discussed in a later section 
of this paper. In the 1947 survey, the 
group test? yielded a correlation of 
—.28 with size of sibship, and the 
Stanford-Binet a correlation of —.32. 
Comparisons of mean scores likewise 
showed a consistent and significant 
drop with increase in family size. 
Thus the mean Stanford-Binet IQ 
dropped steadily from 113 for “only” 
children to 91 for children with five 
sibs (82). On the group test, the 


2Only the verbal part of the group test 
was included in this analysis. The results ob- 
tained with the pictorial part proved to be too 
highly skewed because of low test ceiling; 
hence they have not been analyzed further. 
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mean decrease in score per unit-in- 
crease in family size was .13¢0 (83). 

The tendency for children from 
larger families to score lower per- 
sisted within occupational classes 
(83), a finding which has also been 
reported by other investigators (14, 
21). All such studies, however, have 
employed fairly broad occupational 
categories. Thus it cannot be as- 
sumed that socioeconomic differences 
have been ruled out when compari- 
sons are made within single occupa- 
tional levels. It is also interesting to 
note that, in the highest occupational 
levels, the negative correlation be- 
tween intelligence and family size 
may disappear or be replaced by a 
positive correlation. Cattell (21) and 
Moshinsky (68) report that the high- 
est negative correlations are found 
in occupations of intermediate so- 
cial status. In this connection, ref- 


erence may again be made to the 
factor of upward social mobility, dis- 
cussed in the preceding section. It 


is at the intermediate social levels 
that such a factor would be most 
likely to operate. 

In the 1947 Scottish survey, group 
test scores were available for 974 
twins (83). Analysis of these scores 
corroborated the usual finding that 
twins score lower than singletons, 
the difference in this group corre- 
sponding to approximately 5 IQ 
points. It was further demonstrated 
that such a difference could not be 
attributed to family size or to socio- 
economic level. There was no tend- 
ency for the twins to come from 
larger families, when number - of 
births rather than number of chil- 
dren was considered. Such _ socio- 
economic indices as relative size of 
home and paternal occupation indi- 
cated no inferiority of the twin sam- 
ple. In fact, there was an excess of 
twins in the professional and em- 
ployer groups, and a deficit in the 


F akoaid 
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unskilled labor group. The implica- 
tions of these findings will be con- 
sidered below, after other pertinent 
data have been examined. 

Another extensive survey is that 
conducted among 6- to 12-year-old 
French school children in 1943-44 
(36, 38). This survey included 
95,237 boys and girls, approximately 
a 2 per cent sample of the elementary 
school population of France. Care 
was taken to obtain a representative 
distribution of cases over the twenty 
regions of France. All subjects were 
given René Gille’s “test Mosaique,”’ 
a specially constructed pictorial group 
test of intelligence. When analyzed 
in respect to family size, the mean 
test scores exhibited a consistent: 
drop with increasing size of sibship. 
This differential was apparent within 
each year group, the difference in 
mean score between a_ one-child 
family and a family of eight or more 
being equivalent to 1 or 2 years of 
mental age. Within occupational 
groups, the negative relationship be- 
tween intelligence and family size 
was clearly apparent among farmers, 
manual laborers, and clerical work- 
ers; it was barely discernible in the 
managerial group and negligible in 
the professional class. 

A special study was concerned 
with the length of intersibling inter- 
val (36, Ch. 3; 90). Within the total 
sample covered by the French sur- 
vey, there were 1,244 two-sibling 
families in which both siblings had 
been tested. These were separated 
into “long interval” and “short in- 
ternal” sibships, the latter being 
defined as those falling at or below 


3 This test is described and completely 
reproduced in the first report of the French 
survey (38, pp. 75-96). Data on reliability 
and validity are included in the second report 
(36, pp. 21-45). It was called the ‘‘mosaic” 
test because it includes a wide variety of 
items. 
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the median interval. In the urban 
group the median interval was 24 
months, in the rural 23 months. On 
the intelligence test, the children 
with long intersib intervals obtained 
significantly higher means, these 
differences persisting within each of 
the five occupational categories into 
which the sample was subdivided. 
With long intersib intervals, the 
scores approximated those of only 
children; with short intersib inter- 
vals, they approximated the scores 
obtained by three-child — sibships. 

Test scores were also available for 
750 twins (36, Ch. 3; 90). In this 
survey, family size averaged higher 
for twins than for the total sample, 
even when the twins were counted 
asa single birth. Analysis of paternal 
occupation showed a greater propor- 
tion of farmers in the twin sample 
than in the total sample. With re- 


gard to all other occupational classes, 
however, the twin sample was supe- 
rior to the total sample. 


Thus a 
higher percentage of twins had fa- 
thers in the managerial and _ profes- 
sional classes, and a lower percentage 
in the clerical and manual labor cate- 
gories than was the case in the total 
sample. On the whole, then, the find- 
ings pertaining to occupation are 
similar to those of the Scottish sur- 
vey, in which paternal occupation 
was somewhat superior for twins 
than for singletons. 

The test scores of the twins were 
again inferior to those of the total 
sample. This difference was found 
at each age, and remained when 
family size and occupational level 
were controlled. In the case of fami- 
lies of four or more children, how- 
ever, the twin inferiority was negligi- 
ble and inconsistent, being most pro- 
nounced in the case of two- and three- 
sib families. Moreover, among the 
two-sib families, the twin scores were 
very similar to those of short-interval 
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sibs, as defined in the earlier part of 
the study. 

Other similar surveys have been 
conducted on a less extensive scale 
in several countries. In England, 
there have been a number of at- 
tempts to test complete age samples 
of children within restricted areas. 
Fraser Roberts and his co-workers 
(77, 79) administered the Otis test 
to all children born during a four- 
vear period and living within the 
city of Bath; a subsample was also 
individually tested with the Stan- 
ford-Binet. The Cattell Culture-Free 
Test was used by Cattell (18, 19) 
with all 10-year-old children in Lei- 
cester, a typical industrial city, and 
Devonshire, an “unspoilt rural area.” 
Sutherland and Thomson (89) ap- 
plied a specially developed group in- 
telligence test to all 11-year-old 
school children in the Isle of Wight. 
Burt (14) reports the results obtained 
with the elementary school popula- 
tion of one of the London boroughs 
in the course of his standardization 
of the Binet scale. The Otis test 
scores of 393 10-year-olds in Sheffield 
were analyzed by Bradford (12). All 
of these studies confirm the findings 
of the more extensive surveys regard- 
ing the decline in mean intelligence 
test score with increasing size of sib- 
ship. When correlations were com- 
puted between test score and family 
size, the coefficients were uniformly 
negative and clustered in the .20’s 
and low .30’s. 

A similar inverse relation between 
intelligence and family size is re- 
ported by Papavassiliou (72), who 
gave a Greek adaptation of the Stan- 
ford-Binet to 349 children in Athens. 
Heinen (37) found a tendency for 
family size to be inversely related to 
school grades and other indices of 
academic ability in several large 
samples of German school children. 
In another survey of German school 
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mean decrease in score per unit-in- 
crease in family size was .13¢0 (83). 

The tendency for children from 
larger families to score lower per- 
sisted within occupational classes 
(83), a finding which has also been 
reported by other investigators (14, 
21). All such studies, however, have 
employed fairly broad occupational 
categories. Thus it cannot be as- 
sumed that socioeconomic differences 
have been ruled out when compari- 
sons are made within single occupa- 
tional levels. It is also interesting to 
note that, in the highest occupational 
levels, the negative correlation be- 
tween intelligence and family size 
may disappear or be replaced by a 
positive correlation. Cattell (21) and 
Moshinsky (68) report that the high- 
est negative correlations are found 
in occupations of intermediate so- 
cial status. In this connection, ref- 


erence may again be made to the 
factor of upward social mobility, dis- 
cussed in the preceding section. It 


is at the intermediate social levels 
that such a factor would be most 
likely to operate. 

In the 1947 Scottish survey, group 
test scores were available for 974 
twins (83). Analysis of these scores 
corroborated the usual finding that 
twins score lower than singletons, 
the difference in this group corre- 
sponding to approximately 5 IQ 
points. It was further demonstrated 
that such a difference could not be 
attributed to family size or to socio- 
economic level. There was no tend- 
ency for the twins to come from 
larger families, when number of 
births rather than number of chil- 
dren was considered. Such socio- 
economic indices as relative size of 
home and paternal occupation indi- 
cated no inferiority of the twin sam- 
ple. In fact, there was an excess of 
twins in the professional and em- 
plover groups, and a deficit in the 


mental 
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unskilled labor group. The implica- 
tions of these findings will be con- 
sidered below, after other pertinent 
data have been examined. 

Another extensive survey is that 
conducted among 6- to 12-year-old 
French school children in 1943-44 
(36, 38). This survey included 
95,237 boys and girls, approximately 
a 2 per cent sample of the elementary 
school population of France. Care 
was taken to obtain a representative 
distribution of cases over the twenty 
regions of France. All subjects were 
given René Gille’s “test Mosaique,” 
a specially constructed pictorial group 
test of intelligence. When analyzed 
in respect to family size, the mean 
test scores exhibited a consistent 
drop with increasing size of sibship. 
This differential was apparent within 
each year group, the difference in 
mean score between a_ one-child 
family and a family of eight or more 
being equivalent to 1 or 2 years of 
age. Within occupational 
groups, the negative relationship be- 
tween intelligence and family size 
was clearly apparent among farmers, 
manual laborers, and clerical work- 
ers; it was barely discernible in the 
managerial group and negligible in 
the professional class. 

A special study was concerned 
with the length of intersibling inter- 
val (36, Ch. 3; 90). Within the total 
sample covered by the French sur- 
vey, there were 1,244 two-sibling 
families in which both siblings had 
been tested. These were separated 
into “long interval’ and “short in- 
ternal” sibships, the latter being 
defined as those falling at or below 


3 This test is described and completely 
reproduced in the first report of the French 
survey (38, pp. 75-96). Data on reliability 
and validity are included in the second report 
(36, pp. 21-45). It was called the ‘‘mosaic”’ 
test because it includes a wide variety of 
items. 
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the median interval. In the urban 
group the median interval was 24 
months, in the rural 23 months. On 
the intelligence test, the children 
with long intersib intervals obtained 
significantly higher means, these 
differences persisting within each of 
the five occupational categories into 
which the sample was. subdivided. 
With long intersib intervals, the 
scores approximated those of only 
children; with short intersib inter- 
vals, they approximated the scores 
obtained by three-child — sibships. 

Test scores were also available for 
750 twins (36, Ch. 3; 90). In this 
survey, family size averaged higher 
for twins than for the total sample, 
even when the twins were counted 
asa single birth. Analysis of paternal 
occupation showed a greater propor- 
tion of farmers in the twin sample 
than in the total sample. With re- 
gard to all other occupational classes, 
however, the twin sample was supe- 
rior to the total sample. Thus a 
higher percentage of twins had fa- 
thers in the managerial and _ profes- 
sional classes, and a lower percentage 
in the clerical and manual labor cate- 
gories than was the case in the total 
sample. On the whole, then, the find- 
ings pertaining to occupation are 
similar to those of the Scottish sur- 
vey, in which paternal occupation 
was somewhat superior for 
than for singletons. 

The test scores of the twins were 
again inferior to those of the total 
sample. This difference was found 
at each age, and remained when 
family size and occupational level 
were controlled. In the case of fami- 
lies of four or more children, how- 
ever, the twin inferiority was negligi- 
ble and inconsistent, being most pro- 
nounced in the case of two- and three- 
sib families. Moreover, among the 
two-sib families, the twin scores were 
very similar to those of short-interval 


twins 
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sibs, as defined in the earlier part of 
the study. 

Other similar surveys have been 
conducted on a less extensive scale 
in several countries. In England, 
there have been a number of at- 
tempts to test complete age samples 
of children within restricted areas. 
Fraser Roberts and his co-workers 
(77, 79) administered the Otis test 
to all children born during a four- 
vear period and living within the 
city of Bath; a subsample was also 
individually tested with the Stan- 
ford-Binet. The Cattell Culture-Free 
Test was used by Cattell (18, 19) 
with all 10-year-old children in Lei- 
cester, a typical industrial city, and 
Devonshire, an ‘‘unspoilt rural area.” 
Sutherland and Thomson (89) ap- 
plied a specially developed group in- 
telligence test to all 11-year-old 


school children in the Isle of Wight. 
Burt (14) reports the results obtained 
with the elementary school popula- 
tion of one of the London boroughs 


in the course of his standardization 
of the Binet scale. The Otis test 
scores of 393 10-year-olds in Sheffield 
were analyzed by Bradford (12). All 
of these studies confirm the findings 
of the more extensive surveys regard- 
ing the decline in mean intelligence 
test score with increasing size of sib- 
ship. When correlations were com- 
puted between test score and family 
size, the coefficients were uniformly 
negative and clustered in the .20’s 
and low .30’s. 

A similar inverse relation between 
intelligence and family size is re-: 
ported by Papavassiliou (72), who 
gave a Greek adaptation of the Stan- 
ford-Binet to 349 children in Athens. 
Heinen (37) found a tendency for 
family size to be inversely related to 
school grades and other indices of 
academic ability in several large 
samples of German school children. 
In another survey of German school 





194 


children, Eydt (34) reported more 
retardation and behavior problems 
among children from large families. 

Several American studies have 
yielded similar results. Lentz (54) 
obtained a correlation of —.304 be- 
tween family size and _ intelligence 
test scores among 4,330 urban school 
children in five states. Within sepa- 
rate cities, however, the correlations 
varied from —.065 to —.411. This 
finding is not surprising in view of 
the fact that both the tests and the 
examiners varied from school to 
school. Chapman and Wiggins (24) 
reported a correlation of —.33 be- 
tween National Intelligence Test 
scores and size of family among 650 
school children. When the correla- 
tion was recomputed for children 
with native-born and with foreign- 
born parents, it fell to —.22 and 
—.24, respectively. The authors 


point out that most of the children 
came from completed families, since 
the youngest child in any sibship in- 
cluded in the study was 5 years old. 


In a group of 1,140 8th grade 
school children in Oakland, Cali- 
fornia, Burks and Jones (13) found a 
correlation of —.22 between Terman 
Group Test and total number of sib- 
lings (including deceased siblings). 
This correlation rose to —.31 when 
corrected for restricted range in both 
10 and sibship size. In order to in- 
clude only completed families, the 
analysis was limited to sibships in 
which the youngest child was at 
least 9 years old. A correlation of 
—.31 between family size and Ter- 
man-McNemar intelligence test 
scores was obtained by Damrin (26) 
in a group of 156 high school girls. 

Hirsch (40) found a consistent de- 
crease in mean Otis IQ with increas- 
ing family size in a group of 214 Ten- 
nessee school children. The mean 
dropped from 115.8 for one-child 
families to 96.6 for eight-child fami- 
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lies. Among still larger families, the 
mean varied irregularly, dropping to 
82.0 for twelve-child families. In a 
study of more than 20,000 retarded 
children in the Massachusetts public 
schools, Dayton (27) noted a de- 
crease in mean IQ from 73.2 for one- 
child families to 68.9 for families of 
ten or more children. 

A more thorough analysis of the 
problem is provided in a study by 
Conrad and Jones (25), who tested 
representative samples of the rural 
population of three New England 
states. The children were examined 
with the Stanford-Binet, and the 
adults with the Army Alpha. Family 
size Was investigated in relation to 
intelligence test score of mother and 
father, education of mother and fa- 
ther, a social status index of the fam- 
ily, and mean intelligence test score 
of the sibship. In uncompleted fam- 
ilies, all of these measures correlated 
negatively with family size, although 
only the correlations with father’s 
intelligence and with social status 
reached statistical significance. In 
the completed families, on the other 
hand, all correlations were positive, 
but none was statistically significant. 
Further analyses showed that the 
families of lower intelligence began 
to have children at an earlier age, 
although their total child-bearing 
period was no longer than in the more 
intelligent families. These conclu- 
sions, of course, must be restricted to 
the particular population under in- 
vestigation. 

Similar results were obtained, how- 
ever, in a study of California fami- 
lies reported by Willoughby (108). 
On the basis of parental ages, the 
author concluded that most of the 
families were completed. Parental 
intelligence was determined from the 
combined score on six verbal and 
five nonverbal tests. Number of 
children correlated —.11 with the 
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test scores of 108 mothers and +.06 
with the scores of 87 fathers, neither 
correlation being statistically signifi- 
cant. 

Another study in which intelli- 
gence test scores of parents were em- 
ploved is that of Willoughby and 
Coogan (109). Follow-up data were 
gathered on 373 persons who had 
graduated from high school 12 vears 
earlier and who had taken an intelli- 
gence test while still in school. No 
relationship was found between the 
original test scores and age at mar- 
riage, age at birth of first child, or 
number of offspring at the time of 
the survey. It is also interesting to 
note that within the same group a 
low but significant negative correla- 
tion was found between intelligence 
test score and the number of siblings 
of the subjects themselves. More- 
over, the subsequent education and 
the occupational level of the subjects 
revealed an inverse relation to num- 
ber of offspring. Although this study 
is inconclusive because of the small 
and selected sample and because of 
the incompleteness of the families, it 
nevertheless represents a promising 
approach to the problem.‘ Its chief 
merits lie in the fact that intelligence 
of parents was investigated directly 
and in the fact that such intelligence 
was measured prior to educational 
and vocational differentiation. 

An investigation currently in prog- 
ress provides further information on 
the relationship between intelligence 
of parents and number of offspring.® 
The original data for this study were 
gathered by E. Lowell Kelly as part 
of a broader project involving the 
testing, interviewing, and_ subse- 

4 Cf. critique of this study by Fraser Rob- 
erts (78), but note that one of the criticisms 
is based upon a confusion between number of 
siblings and number of offspring. 

§ Personal communication from Dr. Charles 
F. Westoff, November 4, 1955. Cf. also (67). 
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quent 20-year follow-up of 300 en- 
gaged couples living in New England 
(46). The data pertaining to fertility, 
based on 216 couples, are being proc- 
essed by Charles F. Westoff and 
Elliot G. Mishler of the Office of Pop- 
ulation Research, Princeton Uni- 
versity. At the time of the initial 
testing, the mean age of the men was 
26.6 years and that of the women 
24.6, most of the subjects falling 
within the age limits of 21 and 30. 
Since data on number of offspring 
were obtained 20 vears later, it can 
be assumed that most of the families 
were completed. Educationally, the 
group represented a superior sample, 
more than half of the men and ap- 
proximately a third of the women 
being college graduates at the time 
of initial testing. The correlation be- 
tween initial Otis Intelligence Test 


scores and subsequent number of 


live births was .19 for the 216 men 
and .17 for the 216 women, both be- 
ing significant at approximately the 


.01 level. These correlations thus 
suggest a slight tendency for the 
brighter parents to have larger fami- 
lies. It is noteworthy that none of 
the studies in which parental intelli- 
gence was correlated with number of 
offspring has so far vielded the nega- 
tive correlation customarily found 
when child intelligence is correlated 
with size of sibship. 

The relationship between fertility 
and intelligence has also been investi- 
gated within certain special groups. 
Moshinsky (68) analyzed the test 
scores of 10,159 English children en- 
rolled in different types of schools. 
In the relatively heterogeneous ele- 
mentary school group, a correlation 
of —.23 was found between intelli- 
gence and family size. Within more 
advanced, fee-paying schools, the 
correlations were negligible. Both 
Moshinsky and Blackburn (8) cite 
the latter finding in support of cer- 
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tain hypotheses regarding the rela- 
tionship between intelligence test 
performance and family size. As 
Burt (15) points out, however, such 
a result is to be expected because of 
the greater intellectual homogeneity 
of the children in the more advanced 
schools. 

Hutton (42) reports an insignif- 
cant correlation of —.05 between IQ 
and number of siblings in the case of 
boys attending a British secondary 
grammar school. The group is de- 
scribed as falling within the upper 
half of the general population in in- 
telligence. A questionnaire survey of 
alumni of the same school indicated 
that the highly selected scholarship 
holders as a group tended to have 
a smaller number of offspring than 
the general population of Great 
Britain. But among the more recent 


graduates the reverse was true, the 
scholars showing a higher replace- 
ment rate than the generality. 

A number of studies on college 


students, in both Great Britain and 
the United States, have yielded very 
low and usually insignificant correla- 
tions between intelligence test scores 
and number of siblings (39, 64, 65, 85, 
102, 103). This lack of correlation is 
understandable in the light of the 
highly selected nature of such sam- 
ples. Not only are college students 
selected intellectually; they are also 
selected with respect to the constella- 
tion of psychological and socioeco- 
nomic factors which determine who 
goes to college. Some of these factors 
may themselves be directly related 
to family size. 

In their study of California public 
school children with Stanford-Binet 
1Q’s of 140 or higher, Terman and 
his co-workers found a correlation of 
—.271 between IQ and size of sibship 
within 91 completed families (91). 
Through the subsequent follow-ups 
of the entire gifted group, additional 
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data have been gathered on the com- 
pleted family size of nearly all of 
the original cases. These data, how- 
ever, have not yet been analyzed. 
The forthcoming fifth volume of 
Genetic Studies of Genius will contain 
extensive data on the relation be- 
tween intelligence and number of 
offspring of the gifted subjects them- 
selves.® 

In connection with a slum clear- 
ance survey, Dawson (28) adminis- 
tered the Stanford-Binet to 1,239 
Glasgow children, aged 3 to 14. Most 
of the children were tested just before 
their families moved to a new hous- 
ing development. The correlation 
between these 1Q’s and the number 
of living sibs was —.12; within a sub- 
group of 140 completed families, the 
correlation was —.10. The corre- 
sponding correlations with total num- 
ber of births (including stillbirths) 
were —.20 and —.30, respectively. 
O’Hanlon (71) reports the results of 
retesting 293 of these children eight 
years later. This group yielded a 
correlation of —.137 between IQ and 
number of living sibs, the correlation 
rising to —.291 in a subgroup of 28 
completed families. The correspond- 
ing correlations with total number of 
births were —.207 and —.413. The 
results of both Dawson and O’Han- 
lon illustrate the fact that differential 
fertility is partly offset by a differ- 
ential death rate during childhood. 

In an early study, Pearson and 
Moul (73) found no significant cor- 
relation between size of sibship and 
teachers’ ratings of intelligence 
among 1,202 Jewish school children 
in London. The children were largely 
of foreign-born parentage and came 
from relatively homogeneous and 
low socioeconomic levels. Fertility 
was uniformly high in the group as 


6 Personal communication from Dr. Lewis 
M. Terman, April 28, 1955. 
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a whole. Sutherland (87) reports 
correlations of —.129 and —.126 
between IQ and family size in two 
samples of children of British coal 
miners totaling 3,096 cases. These 
correlations are consistent with the 
findings of other investigators inso- 
far as the relationship is somewhat 
lower in the poorest socioeconomic 
levels than in the intermediate levels. 

In a study of 354 children in a 
Jewish orphanage in New York City, 
Locke and Goldstein (56) found a 
correlation of —.13 between Stan- 
ford-Binet 1Q and size of sibship. 
When both mother’s age and order 
of birth were partialled out, this cor- 
relation rose to —.24. It is rather 
difficult, however, to interpret this 
partial correlation. For compara- 
bility with correlations obtained in 
other studies, moreover, the original 
correlation of —.13 would seem pref- 
erable. 

Sutherland (88) analyzed the IQ's 
of two samples of fatherless children. 
The first consisted of 123 12- to 14- 
year-old Glasgow school children 
whose fathers had died when the 
children were under one vear of age. 
The second sample comprised 724 
Yorkshire school children aged 11 to 
13 who were also fatherless but not 
necessarily from infancy. Each sam- 
ple was compared with a_ control 
group of children. both of whose par- 
ents were living. The mean IQ of the 
fatherless children lower than 
that of control children with the same 
number of siblings. The correlations 
between IQ and family size were sig- 
ficantly negative, but lower among 
fatherless children than among the 
controls. The respective correlations 
were —.188 and —.26 in the Edin- 
burgh samples, and —.19 and —.23 
in the Yorkshire samples. 

In discussing these results, Suther- 
land writes, ‘““The tendency for the 
brighter children to come from the 


was 
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smaller families has been reduced by 
the presence of small families whose 
natural increase has been prevented 
by death and whose children would 
normally have had their place among 
the larger families’ (88, p. 168). A 
similar explanation may account for 
the lower IQ's of the fatherless chil- 
dren within each sibship size. The 
test performance of such children 
resembled that of children in larger 
families, to which they would have 
belonged if family growth had not 
been cut off by the father’s death. 
This type of study represents one 
step toward the analysis of the com- 
plex factors determining the correla- 
tion between intelligence and family 
size. Its findings suggest that it is 
not size of sibship per se but other 
factors associated with family size 
within a given culture which produce 
the obtained differentials in intel- 
lectual level. 


FOLLOW-UP STUDIES WITH 
INTELLIGENCE TESTS 


On the basis of the negative cor- 


relation between intelligence test 
scores and family size, several writers 
predicted a gradual decline in the in- 
tellectual level of the population (14, 
18, 19, 20, 54, 77, 79, 92, 100). The 
estimated drop varied from about 2 
to 4 1Q points per generation. A 
direct test of such a predicted decline 
is provided by follow-up surveys in 
which comparable samples have been 
tested under similar conditions after 
a lapse of several years. The most 
extensive and best controlled data on 
this question are to be found in the 
previously described Scottish  sur- 
vevs (81, 82, 83, 93, 94). Rather 
than showing the predicted decline, 
however, the scores revealed a small 
but significant improvement from 
the 1932 to the 1947 surveys. 

Other similar follow-ups have been 
reported by Burt (14), Cattell (23), 
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and Emmett (33). In comparison 
to the Scottish surveys, these studies 
covered somewhat shorter intervals 
and used smaller samples of ques- 
tionable comparability. When test- 
ing is conducted within a limited area, 
such as a single county, city, or bor- 
ough, successive samples may differ 
owing to selective migration, dete- 
rioration of neighborhoods, slum clear- 
ance, changes in proportion of chil- 
dren institutionalized or attending 
private schools, and other similar 
factors. Moreover, differences in 
thoroughness of sampling procedures 
in successive surveys may introduce 
differential selection. For example, 
subjects in lower socioeconomic levels 
and those who are physically less fit 
are more likely to be excluded when 
sampling coverage is less adequate. 
Despite certain local variations in 
results, however, these studies also 
failed to substantiate the expected 
decline in test score. 

Reference may likewise be made 


to certain relevant data gathered in- 
cidentally, as by-products from other 


types of testing programs. Thus an 
analysis of the intelligence-test per- 
formance of American soldiers in 
World Wars I and II indicated that 
the level of performance had im- 
proved to such an extent that the 
median score of the later sample cor- 
responded to the 83rd percentile of 
the earlier sample (95). This rise in 
score paralleled an increase in amount 
of education, the mean being at the 
8th grade for the first sample and at 
the 10th grade for the second. It is 
also noteworthy that a survey of the 
intelligence test performance of 
American high school students over 
a 20-year period suggested that this, 
too, had improved, despite’ the 
marked increase in the proportion of 
students enrolled in high school (35). 
Since a larger proportion of the total 
population was attending high school 
at the end than at the beginning of 
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this 20-year period, a decrease in 
mean score would be expected, unless 
the total population had improved 
sufficiently to counteract such a drop. 

When educational and other en- 
vironmental conditions within a given 
community improve conspicuously, 
larger rises in test scores may be ob- 
served over even shorter intervals. 
Thus a 10-year follow-up conducted 
in a relatively isolated mountainous 
region of eastern Tennessee showed 
a 10-point rise in median IQ (105). 
During this interval, the socioeco- 
nomic and educational conditions of 
this region had greatly improved. 
The second sample tested in this sur- 
vey was closely comparable to the 
first, the children coming largely from 
the same families in the two cases. 

Similar results are to be found in 
the previously cited sium survey con- 
ducted by Dawson (29). A subgroup 
of 289 4- to 9-year-old children who 
had been tested with the Stanford- 
Binet at the time of their transfer 
from the slum area was retested after 
12 to 18 months of residence in im- 
proved housing. A control group of 
56 cases who had not been transferred 
from the slum area was also retested 
over the same interval. The slum 
clearance group showed a significant 
mean rise of 1.5 IQ points, while the 
control group showed no significant 
change. To be sure, the mean IQ 
rise in the clearance group, though 
significant, was small. But it should 
be borne in mind that the retest was 
conducted over a relatively short 
period of time. Moreover, it is note- 
worthy that the improvement of the 
slum clearance group was also mani- 
fested on tests of arithmetic and 
reading, and persisted when com- 
parisons were made separately for 
boys and girls in all three variables. 
In the control group, on the other 
hand, about half of these compari- 
sons yielded positive changes and 
half negative changes. 





INTELLIGENCE AND 


That intelligence test performance 
is susceptible to improvement as a 
result of education has been repeat- 
edly demonstrated by a variety of 
methods (4, Ch. 8). Of special inter- 
est are two investigations in which 
persons who had been tested follow- 
ing a uniform period of universal 
education were retested after 10 
years in one case and after 20 years 
in the other (41, 58). In the interven- 
ing vears, members of these groups 
had received varying amounts of 
further education. Both studies re- 
port significant relationships between 
amount of subsequent schooling and 
retest performance, when initial scores 
are held constant. 

One of the hypotheses proposed to 
account for the rise in mean 
found in the Scottish survevs and 
similar follow-ups is that of test 
sophistication. Thus it is argued 
that the subjects tested in the later 
samples had the advantage of greater 
familiarity with psychological tests. 
With regard to the Scottish surveys 
themselves, a special analysis of the 
scores obtained by children who had 
and those who had not been previ- 
ously examined with a similar test 
showed little effect of such test ex- 
perience (83, pp. 121-124). In a 
later and more extensive study of the 
same question, two subgroups within 
the 1947 Scottish sample were com- 
pared. One was taken from Educa- 
tion Authorities in which only 2.5% 


of the children were reported to have 


score 


had previous test experience within 
the vear of the survey; the other in- 
cluded Educational Authorities in 
which 67°% of the children had had 
such experience. The mean rise in 
score from 1932 to 1947 in the two 
subgroups was 1.5 and 3.4 points, 
respectively, both being significant 
at the .01 level.” 

7 Personal communication from Mr. James 


Maxwell, University of Edinburgh, May 17, 


1955. 
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Fundamentally, the question of 
test sophistication and the related 
question of coaching must be an- 
swered in terms of breadth of im- 
provement (cf. 1; 2, pp. 52-56). Any 
influence which is restricted to the 
test performance itself and does not 
correspondingly affect the criterion 
behavior which the test is designed 
to predict would of course reduce the 
validity of the test. But the broad 
social and educational changes which 
have occurred in the time intervals 
under consideration can be expected 
to affect the individual’s over-all 
intellectual development, rather than 
being limited to the particular be- 
havior samples covered by specific 
tests. 

Carrying the argument a bit fur- 
ther, some writers have argued that 
‘“‘true, innate’ intelligence may still 
have declined, despite the rise in in- 
telligence test performance brought 
about by improved environmental 
conditions. Some have gone so far as 
to suggest that if test scores show 
improvement under such conditions, 
then the tests are at fault. This is 
analogous to arguing that the ‘‘true, 
innate” height of the population has 
declined as a result of the negative 
correlation between height and fam- 
ily size, and that the observed rise in 
mean height is illusory. Moreover, 
it could then be argued that we 
should devise a ‘‘culture-free’’ meter- 
stick to measure innate height freed 
from environmental influences. Such 


reasoning does not make scientific 


sense. When the psychologist speaks 
of intelligence, he refers to certain 


observable behavior. 
Such behavior is by its very nature 
susceptible to environmental influ- 
ences. The same influences which 
bring about a rise in test score also 
cause an improvement in the quality 
of the individual’s general intellectual 
functioning (cf. 31, 75, 100). 
Penrose (74, 75, 76) has argued 


properties of 
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that, even from a genetic point of 
view, the negative correlation be- 
tween intelligence test scores and size 
of sibship need not imply a decline in 
the intellectual level of the popula- 
tion. Taking into account assortative 
mating and the infertility of low- 
grade mental defectives, he presents 
a simplified genetic model by which 
intellectual level remains constant 
despite the existing fertility differen- 
tials. Essentially, such a situation 
results from the fact that some par- 
ents of average or borderline intelli- 
gence do have above-average chil- 
dren. In a much earlier article by 
Willoughby and Goodrie (110), a 
similar argument is presented. Be- 
ginning with a correlation of —.17 
between intelligence and family size, 
the authors set up a genetic model 
based upon the combined contribu- 
tion of five gene pairs. They then 
trace the effects mathematically 
through five generations, using two 
marital coefficients, .37 and .58. With 
the former coefficient, a decline in 
mean intelligence results. With the 
latter, which is much closer to em- 
pirically obtained husband-wife cor- 
relations in intelligence test scores, 
there is actually a slight rise in mean 
intelligence. 


METHODOLOGICAL PROBLEMS 


Investigations on the relationship 
between family size and intelligence 
present a number of special meth- 
odological problems with regard to 
sampling and statistical analysis. A 
source of difficulty in several studies 
is the inclusion of incomplete families 
in the sample. This factor may oper- 
ate in a number of different and not 
entirely predictable ways. All chil- 
dren from such incomplete families 
are, of course, classified as coming 
from somewhat smaller families than 
will eventually be the case. This may 
have the effect of classifying lower 
test-scorers (from the ultimately 
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larger families) into a category of 
higher test-scorers (from smaller fam- 
ilies). Such a procedure would thus 
reduce the obtained differences in 
test scores between the various fam- 
ily-size categories. On the other 
hand, when a negative correlation is 
found between intelligence and fam- 
ily size in a sample of incomplete 
families, it may simply reflect the 
tendency for persons of lower intelli- 
gence to begin having children ear- 
lier. Should such persons also stop 
having children earlier than the more 
intelligent members of the commu- 
nity, the correlation would disappear 
when completed families are studied. 
In the previously reported investiga- 
tion by Conrad and Jones (25) in 
rural New England communities, 
such was actually the case. 

It should be noted, of course, that 
if the less intelligent parents begin 
having children earlier than the more 
intelligent parents, they will con- 
tribute more individuals to the popu- 
lation in the long run, through more 
frequent generations. For example, 
if each successive generation begins 
to bear children at age 20, there will 
be five generations per century; while 
if child-bearing begins at age 25, 
there will be enly four generations. 
Nevertheless, the effect upon the 
intellectual level of the population 
would be less if the duller parents 
merely began to have children earlier 
than it would be if the duller also had 
larger completed families. 

Age of parents is a factor which 
needs to be considered in its own 
right. The smaller families in a sam- 
ple may include a relatively large 
proportion of incomplete families of 
younger parents. Such parents are 
likely to be better educated than the 
older parents of the completed fami- 
lies, owing to the progressive rise in 
educational level of the population. 
The better educated parents would 
in turn provide a more favorable en- 
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vironment for their offspring. Thus 
when incomplete families are in- 
cluded in the sample, the differences 
in parental age, coupled with the ris- 
ing level of general education, may 
account at least in part for the nega- 
tive correlation found between fam- 
ily size and intelligence. 

A second major problem pertains 
to selective factors. Since the correla- 
tions in question are generally quite 
low, the operation of selective fac- 
tors, however slight, may produce a 
completely spurious result. An ex- 
ample of such a subtle selective factor 
is given in Shuey’s study of students 
in an American women’s college (85). 
Within the entire group of 2,261 en- 
tering students for whom test scores 
were available, a significant inverse 
relation found between test 
scores and number of siblings. This 
relationship however, 
when the whose 
older sisters had attended the same 
college were excluded. Presumably 
somewhat lower entrance require- 
ments had been applied to those 
whose siblings had previously ma- 
triculated. Such students would also 
be likely to come from larger families. 


Was 


disappeared, 


records of those 


Nor are more general samples free 
from selective Thus in a 
sample of children between, let us 
say, the ages of 8 and 14, individual 
large families will appear repeatedly 
within the sample, the number of re- 


factors. 


appearances being directly propor- 
tional to the size of the family. A 


family containing five school-age 
children, for example, would appear 
five times in the sample. Any chance 
conditions characterizing individual 
large families would thus be spuri- 
ously magnified in the sample. Nor 
would ordinary sampling statistics 
be strictly applicable to such samples. 
It is partly to avoid such difficulties 
that single-age samples, such as 11- 
year-olds, have been utilized. 

Even carefully chosen single-age 
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samples, however, are not entirely 
free from selective bias. In such a 
sample, for example, large families 
will be overrepresented, since by 
chance a child of any specified age is 
more likely to be found in a large 
family than in a small one. Hence 
the sample is not representative of 
existing families, nor of the parental 
population. With regard to the child 
population, it can be argued that 
large families should be overrepre- 
sented, since they contribute more 
members to the population. Never- 
theless, certain types of comparisons 
may be distorted through the use of 
The reports of 
the Scottish survey contain a discus- 
sion of the possible effects of such 
selective biases upon the analysis of 
birth order (82) and maternal age 
(83) in relation to child’s intelligence. 

Parenthetically, it may be added 
that studies on the relation of birth 
order to intellectual and other psy- 
chological have fre- 
quently vielded ambiguous and in- 
consistent results because of the 
failure to take family size into ac- 
count. Let us suppose that one group 
consists entirely of two-child families, 
while a second group contains only 
four-child families. Obviously, in the 
first group 50 per cent of the children 
will be first-born, while in the second 
there will be only 25 per cent. In the 
Scottish survey (82), the child’s posi- 
tion in his sibship was recorded as 
14, 25, etc., signifying the first-born in 
a sibship of four, the second-born in 
a sibship of five, and so forth. Com- 
parisons were then made within each 
family size. Under these conditions, 
the first-born and last-born scored 
higher than the intermediate sibs. 
But even these results are inconclu- 
sive because of the previously men- 
tioned sampling biases. 

The computation of predicted in- 
tellectual decline involves question- 
able assumptions. The usual 


a single-age sample. 


characteristics 


pro- 





202 


cedure is based essentially upon the 
comparison of two means obtained 
from the same set of (cf., 
e.g., 14, 16). The mean of the sub- 
jects’ scores can of course be com- 
puted directly. The mean of the 
next generation is estimated by mul- 
tiplying the frequency in each class- 
interval of the IQ distribution by 
mean family size in that interval and 
using these new frequencies to find 
the mean. Similarly, the mean score 
of the parents of the present subjects 
is estimated by dividing the original 
frequencies by family size and mul- 
tiplying by two. 

Such a procedure is based upon the 
assumption of a perfect parent-child 
correlation in both intelligence test 
score and family size. In neither case 
is the assumption consistent with 
available information. Parent-child 
correlations in intelligence test scores 
are generally in the .50’s (cf. 4, pp. 
318-320). With regard to fertility, 
parent-child discrepancies may arise 
from various sources, including the 
decrease in fertility differentials 
among socioeconomic classes and the 
social mobility of individuals. Avail- 
able data indicate that when persons 
move from one culture or subculture 
to another, their fertility tends to be 
intermediate between that of the 
group from which they have come 
and that of the group which they 
join. 

Another noteworthy methodologi- 
cal point pertains to the definition of 
mean family size. When size of sib- 
ship is recorded for each child and 
these sizes are averaged, the mean 
family size per child is obtained. On 
the other hand, when number of chil- 
dren in each family is recorded, as in 
a census survey, the average of these 
numbers gives mean family size per 
family. 
of 5 families consisting of 1, 2, 3, 4, 
and 5 children, respectively, the me- 


scores 


Thus if we have a sample 





ANNE ANASTASI 


dian family has 3 children. But the 
median child, out of the total 15 in 
the sample, belongs to a family of 4. 
The corresponding means are 3 for 
families and 3.67 for children. The 
latter is the contraharmonic mean 
discussed by Jaspen (44, 45) and 
bears a direct arithmetic relation to 
the former. If X represents number 
of children in each family and N 
represents number of families, the 
mean per family is 2X/N and the 
mean per child is {X?/ZX. In mak- 
ing comparisons between mean fam- 
ily sizes of populations or segments 
of a population, it is obviously essen- 
tial to insure that the same type of 
mean is emploved. This point is 
especially likely to arise when results 
obtained with a child sample are com- 
pared to national 
since the former are usually expressed 
as mean per child, the latter as mean 
per family. 

Attention should likewise be given 
to certain problems dealing with the 
choice and application of psvchologi- 
cal tests in studies on fertility and 
intelligence. An intrinsic feature of 
the design of the Scottish surveys in- 
volved the transmutation 
from a group test, administered to 
the entire sample, to Stanford-Binet 
IQ's obtained from a cross section of 
the larger sample. It was apparently 
felt that the scores of the entire sam- 
ple would gain in meaningfulness if 
put in terms of such 1Q’s. Apart from 
the wider familiarity of the 1Q, how- 
ever, it is difficult to see what such a 
Moreover, 


census figures, 


of SCOres 


conversion accomplishes. 
a number of specific objections can 
this 
Since different intelligence tests meas- 
ure a somewhat different combina- 
tion of functions, individuals cannot 
be expected to retain the same rela- 
tive standing on such tests. Intelli- 
gence tests vary not only in score 
scale and in normative population, 


be raised against procedure, 
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but also in content. 
and validity of the 
depend upon the proper 
which was 
rather than upon the propert 
the test into which the 
converted. For example, if an indi- 
vidual test is precise 
of its better control of 
motivation, | 


The reliability 
converted scores 
ties of the test 
administered, 


ies of 


were 


actually 
scores 
more because 
rapport and 
ynger testing time, or 
any other { favorable conditions, these 
character not transferred to 
the test by transmutation of 
scores. Nor is the use of the tradi- 
tional ratio IO ac 

purposes. 


Stic S are 


group 


Ivisable for such 
Standard scores, ex 
1Q’s or in 
form, ha 
nd them for | 


pre 
as deviation some ot 
convenient more 


recomm 


the exclusive use 
nlelligence in 

uestioned. 

the 

would apy 

correlations 

as temp 

means, 

homoge 

It wou 

to discover 

relat 


nd whether 


al ay titudes, a 
the rise in me 
up surveys is 


or numeri 
n score found in follow- 
uniformly high in all 
aspects of intelligence. 


ANALYSIS OF CAUSAL RELATIONS 
Even with similar results, 
investigators have 


terpretations of 


different 
offered varied in- 
correlation be- 
and family. size. 


It is especially important in this con- 


tween intelligence 


nection to differentiate between 
plausible hypotheses and empirically 
established facts, and to ferret out 
tacit assumptions which may under- 
lie certain For 
ample, if is defined in 


interpretations 
intelligence 


eX- 


AND 
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terms hereditary predispositions, 
it becomes logically impossible to 
investigate whether or not certain 
individual differe: intelligence 
may be related to gene constitution. 
It is with scientific 
method to employ definitions which 
do not prejudge the data, but which 
permit the maximum freedom of in- 
terpretation. 
We may, of course, 


word ‘intelligence’ to 


ices in 


more consistent 


define the 
mean any- 
All studies on the 
relation of intelligence to family 
however, have 


thing we choose. 
size, 
emploved psychologi 
intel- 
evident that 
must take such 
s their point of departure. 
When intelligence is defined such 
terms, we dis- 


cal test scores as their index of 


ligence. It is therefore 


’ 


ions 

test scores a 
ma\ nize three 
ological mechanisms whereby 
ale snpvadeetisies between 
and family size might 

It is understood, 


tr , 


3, 59 
» that the actual causal rela- 

involve 

all three of 


combination 
these mecha- 
, there mav be inhertted struc- 
neural, glandu 


is constr iints, 


ilar, etc.) 
reducing 
f behavior and limiting 
1e pe cama development 
asured by current intelligence 
The less able parents would 
1us transmit their hereditary limita- 
tions to their offspring. The obtained 
correlations would then result from 
the fact that, within a given culture 
or subcultu persons with inferior 
heredity messen to have more off-" 
spring. 
A second 


individual 


attributes 
children’s 
abilities to psychological differences 
ronments 


explanation 
differences in 


the env 

f varving 
In this case, 
family 


provided by par- 
intellectual levels. 
the correlation between 


size and intelligence of off- 
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spring would again result from a tend- 
ency for the less intelligent parents 
to have more children, but heredity 
would not be involved. Differences in 
intellectual level among the parents, 
as well as among the offspring, could 
thus have resulted from 
mental factors. 

In this connection, mention should 
also be made of the point of view 
represented by Vernon (101). In a 
discussion of the use of intelligence 
test scores in population studies, he 
urges the need for caution in inter- 
preting such scores because of the 
effect of differences in cultural pat- 
terns upon test performance. Such 
cultural differences, however, may 
affect not only test scores but also 
the individual’s over-all intellectual 
development. The real question per- 
tains not to inadequacies of available 
measuring instruments but rather to 
the interpretation of the existing be- 
havioral differences 

It may be added that the indi- 
vidual’s intellectual development may 
be influenced, not only by the nature 
and extent of direct intellectual stim- 
ulation offered by his home, but also 
by emotional and motivational fac- 
tors deriving from the cli- 
mate” of the home. There is a grow- 
ing body of data pertaining to differ- 
ences in practices in 
various socioeconomic levels and 
other subcultures (4, Ch. 23 and p. 
734). Some of these differences, such 
as the degree to which verbalization 
and exploratory behavior are en- 
couraged or discouraged, are likely 
to influence the child’s intellectual 
development. 

A third possible interpretation of 
the obtained correlations between 
intelligence and fertility is based upon 
size of family itself as a causal factor. 
For example, a larger family—at 
least in certain socioeconomic levels— 
would reduce the per capita funds 


environ- 


“social 


child-rearing 
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available for education, recreation, 
suitable housing, proper food, medi- 
cal attention, and other environ- 
mental requisites. From a psycho- 
logical viewpoint, another important 
factor is the degree of adult contact 
provided in families of different sizes. 
Available evidence suggests, for ex- 
ample, that such contact may be the 
most important single factor in lin- 
guistic development (4, pp. 335- 
339; 62). And it is well known that 
verbal ability plays a major role both 
in educational progress and in intelli- 
gence test performance. Whatever 
the specific factors involved, the 
causal mechanism under considera- 
tion is independent of the intellectual 
level of parents. A crucial test of this 
hypothesis would thus be found in 
the correlation between size of sibship 
and intelligence of offspring, when 
intellectual level of parents is held 
constant. 

One additional point may be con- 
sidered. Insofar as family size itself 
may be a factor in intellectual de- 
velopment, the general decrease in 
family size in the period covered by 
most follow-up surveys is notewor- 
thy. Quite apart from fertility dif- 
ferentials, such a decrease may have 
operated as one factor in the observed 
rise in intellectual level. 

It is evident that the three hy- 
potheses differ significantly in both 
their theoretical and practical impli- 
cations. On the basis of existing data, 
it is impossible to choose among them 
or to determine the relative contribu- 
tions of each of the three types of in- 
fluences. The complex interaction of 
many variables makes analysis of 
causal relations difficult in this area. 
A few of the previously cited results, 
however, have a bearing upon causal 
interpretations. Thus Sutherland’s 
study. of fatherless children sug- 
gested that family size per se is insul- 
ficient to account for the obtained 
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correlations between family size and 
intelligence (88). On the other hand, 
the findings on twins in both the Scot- 
tish (83) and French (36, 90) surveys 
are consistent with interpretations in 
terms of adult contact. The results 
of Tabah and Sutter (36, 90 
sib interval likewise seem to support 
such an explanation. The fact that 
studies of parents have so far failed 
to demonstrate a negative correlation 
parental 
number of offspring 1s 
with the 
above. 
Vernon (101 
study of orphans and foster childret 
whose true parents’ families diffe: 
in size. Such an approach would ur 
doubtedly provide crucial data 
the first hn pothesis, 


the contribution of hereditary fac 


on inter- 


between intelligence and 


also lil F 14 


third hy pothesis outlined 


recommends the 


which con 


to the obtained correlatio: 


intelligence and family. size 


type ol investigation, however 
be handicapped by a number 
such as the lit 


in number and range of 


tical obstacles, 
cases. Moreover, family 
phans and foster children would 
artificially restricted by such 
tions as death of one or both pa 
and illegitimacy. 


A study 


test the role of 


lly designed 
ntal cor 
intelliger ce and 
family size is reported by Nisbet (69, 
70). Three tvpes of data 

tained on several thousand Aberdeet 
First, in the effort 
to gauge the contribution of lang 
development to the correlation be 
tween family size and performan¢ 
on the ordinary verbal tvpe of intelli- 


specihe 
pare tact 


the association of 
were ob 


s¢ hool children. 


gence test, the latter correlation was 
found with language test scores par- 
tialled out. This partial correlation 
was only —.04. When family size 
and language test score were 
related, with intelligence test score 


COrT- 
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partialled out, a higher correlation 
(—.11) was obtained. Secondly, ver- 
bal intelligence tests vielded signifi- 
cantly higher correlations with fam- 
ily size than did such nonverbal tests 
as Raven's Progressive Matrices. 
Thirdly, the correlation between in- 
telligence test scores and family size 
The latter finding was 
both a 
study of the large sample anda longi- 
tudinal study of a small sample 
From the results of these three ap 
| hes, Nisbet concluded that part 
ot the negative correlation between 
family size and intelligence may be 


rose with age. 


obtained in cross-sectional 


attributed to the effect of sibship size 
upon verbal development, which in 


irn influences intellectual develop- 


study on 288 adult 
Nisbet (80 again 


negative correlation 


a later 

; Scott and 

it the 

size of sibship and intelli 

ce test score is lower for nonverbal 
for verbal tests. Such results 
rroborate Nisbet's findings on the 
itd sample des ribed above, and 
gest that the environmental fa 
rs associated with family size may 
fiect test well into 


Lisle} 
ill 


periormance 
ood. 

It should be noted that the ps\- 
chological ettect of family size upon 
intellectual lopment 
curvilinear than 
Moreover, the direction in which in- 


deve Mm1av\ be 


rather rectilinear. 
size influences intel- 
ctual development may itself differ 
with concomitant psychological and 


sf r valy 
creasing family 


social circumstances. Attention should 
likewise be given to the possible et- 
fects of family size upon 
lectual characteristics, 
cialization, 


nonintel- 
such as so- 
emotional 
security, leadership, and other aspects 
Available 
research on this question has vielded 
conflicting and ambiguous results (9, 


10, 11, 26, 27, 32, 55, 57, 86). 


cooperation, 


of interpersonal relations. 


Some 
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studies provide evidence that certain 
desirable emotional characteristics, 
as well as acceptance by associates, 
may be positively related to member- 
ship in relatively large families (9, 
11,27). Most of the results, however, 
are complicated by the correlation 
between indices of social adjustment, 
on the one hand, and both intellectual 
and socioeconomic level, on the other. 
Differences in family size among 
ethnic subgroups within a sample are 
another source of confusion in such 
data. 

In recognition of the methodologi- 
cal and interpretative complexities 
of the problem, there is a growing 
interest in the design of more nearly 
definitive investigations on the rela- 
tionship between intellectual level 
and fertility.’ Ideally, such investi- 
gations should begin with the testing 
of young people prior to their educa- 
tional and vocational differentiation, 
i.e., after all have completed a uni- 
form period of required schooling. 
Preferably the test should consist of 
a differential aptitude battery vield- 
ing a profile of scores rather than a 
single global measure. The subjects 
should be followed up until all or 
nearly all of their families are com- 
pleted. Age of both parents at the 
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birth of their first and last child 
should be recorded. Data should also 
be kept on deaths, unmarried _per- 
sons, and childless marriages.  In- 
formation should likewise be gathered 
regarding occupation, income level, 
and amount of subsequent education 
for each member of the group. It 
would also be of interest to obtain 
indices of social mobility, such as 
changes in occupational, educational, 
or income level within the subject’s 
own life, as well as differences be- 
tween his status and that of his par- 
ents. 

Additional questions can be an- 
swered if test scores of the children 
of these studied: but 
many problems can be solved even 
prior to this step. From a practical 
viewpoint, such a program is not un- 
realistic, especially in nations where 
uniform psychological tests are ad- 
ministered in the school system and 
where detailed census data are regu- 
larly collected. From a theoretical 
standpoint, this approach would help 
the many interrelated 
which are now intricatel\ 


persons are 


to separate 
variables 
intertwined, and should thus bring us 
closer to a causal interpretation of 
the empiric ally observed relationship 


between intelligence and family size. 


RENCES 


Fertility and education 


Stat. Bull Metre pl. 


5. ANONYMOUS. 
level of parents 
Life Insur., 1945, 26, No. 11, 6-7. 

BALTZEI 4 E. D. Social mobility 
fertility within an elite group 
bank Mem. Fund Quart., 
411-420. 

BeRENT, J. Fertility and social mobility. 
Popul. Stud., 1952, §, 244-260 

BLACKBURN, J 
score, and social class. 
1947, 1, 165-176. 

Bonney, M. E. A study of the relation 
of intelligence, family and sex 
differences with mutual friendships in 
the primary grades. Child Developm., 
1942, 13, 79-100. 

Bonney, M. E. Relationships between 


1953, 


Family size, intelligence 


Pe pul. Stud., 


size, 








INTELLIGENCE AND FAMILY SIZE 207 


social success, family size, socio-eco- 28. Dawson, S. Intelligence and fertility. 
nomic home background, and intelli- Brit. J. Psychol., 1932, 23, 42-51. 
gence among school children in g1 29. Dawson, S. Environmental influence on 
III to V. Sociometry, 1944, 7 2 mentality. Brit. J. ichol., 1936, 27, 
Bossarp, J. H. S., & Bott, ELEANOR S 129-134. 
Security in the large _ ly. Ment DINKEL, R. M. Occupation and fertility 
Hyg., N. Y., 1954, 38, 544 in 1“ United States. Amer. sociol 
BRADFORD, E. J. oe. Can preset ho- Rev., 1952, 17, 178-183. 
lastic standards be maintained un | NCA ‘Is thel itelligence of the 
Educ., 1925, 3, 186-198 gener: opulation declining? Amer. 
Burks, BARBARA S., & JONES, bed |. Rev., 1952, 17, 401-407. 
study of differential fertility is \ \2. EKSTRAND, RutH. The variation of be- 
California cities. Hum. Biol., 1935, 7, 1avior problems with size of family, 
539-554. Smith. ] Stud. Work, 1931, 1, 
Burt, C. Intelliger ‘ 
don: Hamilton, 1946. I ETT, : The trend of intelli 
Burt, C. Fa mily ze, intelligence, and ice in certain districts of England 
ial class oat tud., +7, pul. Stud., 1950, 3, 324-337. 
186. . Eyprt, Auslese und Ausmerze in der 
i | ‘olk u. Rasse, 1938, 13, 


nt increases and 
4 (91 1 i it anges I mental level. ] 
ea 6 2), a. Psychol. Monogr., 1946, No. 10. Pp. 
B. Is national intellig e 6 LLE, R., HENRY, L., TaBan, L., St 

] t , GIRARD, os & 

iu intellectuel 

laire: la eee 


3 ond g, nination aptitudes; ‘influence 
“ATTELL, R — idv of the nat il d ur nstitutionnels, famili- 


l 
] 
reserves O t 


ist. nat. d'études 


{oil | aux, 1 
1937, 11, 127-137. dém iphiques: 1vaux et document 


ATTELL, R. We Some further relations thier 23, 1954. Pp. 294. 
between ntelligence, fertility ind HEINEN, : hulleistungen, viter- 
ocio-economic factors. Suge Rev., licher erul, un Kinderzahl. £. 
937, 29, 171-179 ne h. Vererb.- 1 -onst.- Lehre, 1938, 
Intelliger ; 
445, 36, 126-127. 3 IEUVYER, RON, H., PrérRoN, MMe. 
niveau intellec- 
scolaire: 
prediction. sugen. Rev., . : 1éte itional ns l'enseign 
136-148. prima In nat. d'études 
CHAPMAN, J. C., & WiGGINs, gra ues: 1vaux et documents, ( 
lation of family size to intelligen hier 13, 1950. Py 
HIMMELWEIT, HitpE T. _ Intelligen 
mily; their relatior 
ll ip ol high educat 
ird. Eugen. Rev., 1948, 40, 77-88. 
tudy. he rential bir rat IRS‘ NATHANIEL, D. M. An experi 
Amer. Statist. « , , £53 n ‘ tudy upon three hundred 
159, school children over a six year period 
DAMRIN, Dora E mily size and sil renet. Psy . Monogr., 1930, 7, 493- 
ling age, sex, and 
to certain aspects of adjustment. J eae ; 1e influence of schoolin 
soc. Psychol., 1949, 29, 93 upon IC rheoria, 1951, 17, 61-88 
Dayton, N. A. nfluence of size of 2. HUTTON, itelligence quotients and 
family upon the characteris ! | d é 
mentally deficient. Survey : 
retarded children in the public schools J , A. J. Urbanization and fertil 
of Massachusetts. Amer. J. Psychiat., Amer. J. Sociol., 1942, 48, 48-00. 
1935, 91, 799-832. . JASPEN, N. A class of means. Paper 








ANNE ANASTASI 


read at East. Psychol. Ass., New 


York, April, 1954. 


5. JASPEN, N. Means and their standard 


errors. Paper read at Amer. Psychol. 
Ass., New York, September, 1954. 

KELLy, E. L. Consistency of the adult 
personality over twenty vears. Amer. 
Psychologist, 1955, 10, 659-681. 

Kiser, C. V. Group differences tn urban 
fertility. Baltimore: Williams & Wil- 
kins, 1942. 

Kiser, C. V. Exploration of possibilities 
for new studies of factors affecting 
size of family. Vilbank Mem. Fund 
Quart., 1953, 31, 456-479. 

Kiser, C. V. The Indianapolis fertility 
study—an example of planned obser- 
vational research. Publ. Opin. Quart., 
1953-54, 17, 496-510. 

Kiser, C. V., & WHELPTON, P. K. Social 
and psychological factors affecting 
fertility: II. Variations in the size of 
completed families of 6551 native- 
white couples in Indianapolis. Mil- 
bank Mem. Fund Quart., 1944, 22, 
72-105. 

Kiser, C. V., & WHELpTON, P. K. Re 
sumé of the Indianapolis study of 
social and psychological factors affect- 
ing fertility. Popul. Stud., 1953, 7, 
95-110. 

Kuczynski, R. R. British demogra- 
phers’ opinions on fertility, 1660-1760. 
Ann. Eugen., 1935, 6, 139-171. 

Ler, E. S., & LEE, ANNE S. The differ- 
ential fertility of the American Negro. 
Amer. sociol. Rev., 1952, 17, 437-447. 

Lentz, T. F., Jr. Relation of IQ to size 
of family. J. educ. Psychol., 1927, 18, 
486-496. 

Levy, J. A quantitative study of be- 
havior problems in relation to family 
constellations Amer. J. Psvc hiat ’ 
1931, 10, 637-654. 

Locke, N. M., & GoutpstEIn, H. The 
relation of birth order, age of mother, 
and size of family to intelligence. J. 
Psyc hol., 1937, 3, 89-96. 

Loomis, C. P., BAKER, W. B., & PRoc- 
ror, C. The size of the family as 
related to social success of children 
Sociometry, 1949, 12, 313-320. 

LorGeE, I. Schooling makes a difference 
Teachers Coll. Rec., 1945, 46, 483-492. 

LoRIMER, F. Trends in capacity for 
intelligence. Eugen. News, 1952, 37, 
17-24. 

Lorimer, F., et al., Culture and human 
fertility. Paris: Unesco, 1954. 

Lorimer, F., & OsBorn, F. Dynamics 
of population. New York: Macmillan, 
1934. 


62. McCartuy, DorotHEA. Language de- 


velopment in children. In L. Car- 
michael (Ed.), Manual of child psy- 
chology. New York: Wiley, 1954. 

MACMEEKEN, AGNES M. The intelligence 
of a representative group of Scottish 
children. London: Univer. of London 
Press, 1939. 

MAXWELL, ALINE, & Huestis, R. R. 
Student test-score rank and family 
size. J. Hered., 1930, 21, 211-215. 

MAXWELL, J. Intelligence and family 
size of college students. Eugen. Rev., 
1951, 42, 209-210. 

MAXWELL, J. Intelligence, fertility, and 
the future: A report on the 1947 Scot 
tish Mental Survey. Eugen. Quart., 
1954, 1, 244-274. 

MISHLER, E. G., Wesrorr, C. F., & 
KELLY, E. c. Some psychological 
correlates of differential fertility: a 
pager oy study. Amer. Psychol- 
ogist, 1955, 10, 319. (Abstract 

MOsHINSKY, PEARL. The correlation 
between fertility and intelligence 
within social classes. Sociol. Rev., 
1939, 31, 144-1605. 

NIsBET, J. D. Family environment: 
a direct effect of family size on intelli 
gence. London: Cassel, 1953. 

NisBET, J. D. Family environment 
and intelligence. Eugen. Rev., 1953, 
45, 31-40. 


. O'HANLON, GERARD S. A. An investiga- 


tion into the relationship between 
fertility and intelligence. Brit. J 
educ. Psy hol., 1940, 10, 196 ati. 

PAPAVASSILIOU, I. T. Intelligence and 
family size. Popul. Stud., 1954, 7, 
222-226. 

PEARSON, K., & Mout, MARGARET. 
rhe problem of alien immigration into 
Great Britain, illustrated by an exam- 
ination of Russian and Polish Jewish 
children. Ann. Eugen., 1925, 1, 5-127. 

PENROSE, L. S. The supposed threat of 
declining intelligence. Amer. J. ment 
Deyic., 1948, 53, 114-118. 

PENROSE, L. S. Genetical influences on 
the intelligence level of the population. 
Brit. J. Psychol., 1950, 40, 128-136. 

PENROSE, L. S. Propagation of the un- 
fit. Lancet, 1950, 259, 425-427. 

Roperts, J. A. FRASER. Intelligence and 
family size. Eugen. Rev., 1939, 30, 
237-247. 

Roperts, J. A. FRASER. The negative 
association between intelligence and 
fertility. Hum. Biol., 1941, 13, 410- 
412. 

Roserts, J. A. FRASER, et al. Studies 
on a child population. III. Intelli- 











INTELLIGENCE AND FAMILY SIZE 209 
gence and family size. Ann. Eugen a f SOA /Series A 
1938, 8, 178-215 il tud ( 7. New York: 

80. Scort, E1LEEN M., & Nispet, | nited Nz ' } 

| ] 7 D NATION P F Social 


‘ ! ' ’ ‘ 1, 1? , ' ; 
Intelligence and family s In an adul 
I Ol t 1 a Pro- 


i nple ku 


COMMERCE, 
Fertility 


Rep 


] il tract 448, Series 


ScoTTisH CoUNCIL FOR RESEARCH 
Ept TION. The trend of Scottish in ~. ; 
r. of lor NITEI ATES r. Ol OMMERCE, 


Pt. 2, 120 
TERMAN, I 
New York 
1946-1958, 
1t\ and pa- 


rental int y 
1928, 40, 671-67. 

WILLOUGHBY, R 
GUERITI The corre 


» 
& CooGcan, Mar- 


lation between in- 


R., 


telligence and fertility Hum. B 


- the Scottish IT \ ugen 
, 1950, 41, 163 1940, 12, 114-119 
UDDENHAM, R. D. Soldi ntelliger WitLoucuBy, R. R., & Gooprig, 
World Wars I and II. Amer. P y- MIRANDI. Neglected factors in the 
chologist, 1948, 3, 54-56 difierential birth rate problem. J 
UnitEp Nations, Dept. OF Soctat genet ychol., 1927, 34, 373-393. 
AFFAIRS. Population Division. The 


determinants and consequences of popu- Received June 6, 1955 





PSYCHOLOGICAL BULLETIN 
Vol. 53, No. 3, 1956 


STUDIES OF 


BRAIN DYSFUNCTION 





IN SCHIZOPHRENIA!” 


GLEN A. BRACKBILL 


Veterans Administration, Palo Alto 


In 1924, Charles Dunlap (24) 
stated quite unequivocally that there 
were no organic lesions or histopatho- 
logical findings which could be shown 
to be characteristic of schizophrenia. 
According to him the changes which 
had been reported were not essen- 
tially different cell 


from nerve 


changes found in nonpsychotic per- 


sons. This rather sweeping conclu- 
sion was based on his comparison of 
the brains of eight schizophrenic 
patients with those of four ‘‘control”’ 
subjects. His control group 
sisted of a bootlegger who had been 
shot through the lung and died 
shortly thereafter, two men who died 


of arsenic poisoning, 


con- 


and one woman 
who died of peritonitis. It appears, 
contrary to Dunlap’s statement, that 
there is room to question the 
mality” of such a group. At any 
rate, this pronouncement apparently 
had an inhibiting effect on further 
investigations into the role of organic 


*'nor- 


Few such 
investigations were undertaken in the 
decade following Dunlap’s authorita- 
tive statement. In 1930, Freeman 
(33) quoted Dunlap’s conclusions 
but was willing to report finding a 
deficiency of catalytic iron in the 
nerve cells of schizophrenics. Here 
again seems to be evidence of the 
profound effect of a statement backed 
by an authority figure which blocked 
progress in an area deserving of con- 
tinuing experimental study. 

Perhaps the electroencephalogra- 


factors in schizophrenia. 


1From the Veterans Administration Hos- 
pital, Palo Alto, California. 

21 would like to thank Mary Ann Swanson 
and Elizabeth Grisak for their helpful assist- 
ance. 


210 


phers were unimpressed by authority 
or were not aware of it. At any rate, 
they appear to have been, in large 
part, responsible for rearousing inter- 
est in the search for an organic factor 
in schizophrenia. In the process of 
recording the brain waves of various 
groups of people, they found among 
schizophrenics waves resembling 
those found in cases of known brain 
Histopathologists and _ bio- 
have followed up 
leads so that the recent psychiatric 
literature presents a number of pro- 
vocative studies. It is the purpose of 
this review to present some of these 


damage. 


chemists these 


recent findings and to suggest some 
hypotheses. 

While it is probably true that most 
of the work in the physiology, genet- 
ics, and psychology of psychoses is 
at least indirectly related to the sub- 
ject of brain dysfunction, the electro- 
encephalographic, histopathological, 
and biochemical studies are both 
directly relevant and of sufficient 
themselves to 
concentrated attention. 


merit 
The review 
will include these studies as well as 


importance in 


several psychological and phvysiologi- 
cal studies which focus specifically on 
the question of brain disorder in 
schizophrenia. Studies made within 
the last twenty to twenty-five years 
will be included. 

It is felt that psychologists could 
make a worth-while contribution to 
the studies of possible central nervous 
system pathology among schizo- 
phrenics. Measurement of organic 
damage by various kinds of psycho- 
logical tests has been rewarding to the 
extent that tests have been shown to 
be useful in research and in theoreti- 
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the nature of 
thinking in cases of cerebral damage. 


cal formulations about 


In many clinics and hospitals neurol- 
ogists have come to depend on 
chologi al the 
study of central nervous system in- 
volvement. It is hoped that this re 
view 


YS) 
psy- 


: 4 
qaiagnosti 


tests in 


will stimulate research interest 


on the part of psychologists who are 


concerned with the phy siologic al cor- 


relates of behavior. 
EEG Stup!lgEs 
In 1937, Travis and Malamud (76 
reported a study unfavorable to the 


hypothesis that schizophrenics show 
patterns different from the normal. 


These investigators were apparently 


only secondarily interested in this 
they 
interested in testing some of Berger's 
original formulations about the na- 
ture of the EEG. they did 
find no differences in either amplitude 


since were more 


possibility 


Lloweve Ls 
or frequency of the waves of nine 
schizophrenics compared with those 
of a group of seventy-five normals 
and a group of stutterers. They were 
aware of the small size of their group 
of schizophrenics, an 
in their conclusions. 
In the following vear, Gibbs, 
Gibbs, and Lennox (34) found that 


epilepsy is present about three times 


1 are tentative 


as often among schizophrenics as in 
the general population. Gibbs et al. 
point out that their experiel ce shows 
that the majority of schizophrenics 
they have tested show abnormalities 
of rhythm. They found both fast and 
slow EEG patterns. Although they 
do not present they were 
impressed by the similarity of many 


statistics, 


of the records of schizophrenics to 
This 
finding should arouse some interest 
in the possibility of an organic factor 
in those cases in which an epileptic 
EEG is found. The meaningfulness 
of such a question would, however, 


those of psychomotor epilepsy. 
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depend upon the acceptance of an 
organic basis for psychomotor seiz- 
ures, 

Jasper, Fitzpatrick, and Solomon 
48) studied the EEG’s of 82 schizo- 
phrenics, so diagnosed by agreement 
of five psychiatrists (no small ac- 
complishment in itself). They also 
included a group of 51 epileptics and 
60 normal subjects. They found, as is 
expected of schizophrenic records by 
now, no specific EEG pattern, but 
rather, a wide range of variability. 
Contrary to the study of Travis and 
Malamud, they found a 
anomalies to a more extreme degree 
greater number among 
schizophrenics than normal subjects. 
specifically, they report that 23 per 

ry 


cent of the c: 


variety of 


and in a 


ises diagnosed as scniZo- 


phrenic showed either clinical or 
electroencephalographic evidence of 
brain activity similar to that char- 
They are 
their conclu- 
sions about the role of organic factors 
in producing the abnormal EEG's. 
They state, 15°) of our group of pa- 
tients schizophrenic 
may well be suffering from mental 
disorders due to head injury, mental 
deficiency, or other organic defects 
of the brain’’ (48, p. 849 

Another study involving 52 chronic 


hrenics and 500 control cases 


acterizing the epileptic. 


somewhat hesitant in 


diagnosed as 


schizop 
reports essentially the same results. 
Davis and Davis (19) also feel that 
schizophrenic patients do not have a 
typical ‘“‘schizophrenic EEG” in the 
same sense as, for example, the spike 
and slow wave of petit mal epilepsy. 
They found variations in wave pat- 
terns the range of normal 
They come to much the 
same conclusions as Jasper et al., but 
are more cautious in their concluding 
statement. “The quality of the 
atypical variations resembles that of 
the changes which occur in the elec- 
troencephalograms of persons known 


outside 


subjects. 
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to have brain lesions or who suffer 
from the various forms of epilepsy, 
or normal persons during sleep. or 
when breathing an inadequate supply 
of oxygen” (19, p. 1019). They per- 
mit the reader to draw his own con- 
clusions about the meaning of their 


findings in regard to possible organic 


factors in some schizophrenics. 

In 1940, Rubin (70) published 
EEG studies of 14 schizophrenics, 2 
manics, and 1 case of traumatic psy- 
chosis. He used the per cent time 
alpha as a measure. By studying the 
distribution of per cent time alpha 
over the various areas of the cortex, 
he felt that he was able to detect the 
presence of atrophy in these cases. 
His diagnosis was confirmed in 10 
of the cases by 
gramis. 
on the remaining subjects. Rubin 
states not only that there was central 


pneumoencephalo- 
Air studies had not been done 


nervous system pathology 
but that the nature of 
pathology was atrophy. 
results were confirmed by Moore, 
Nathan, Elliott, and Laubach (55). 
In the majority of their sixty cases 
they found evidence of atrophy, en- 
largement of the ventricular system, 
and cisternae. A study of the films 
suggested that there was a selective 
atrophy involving the parietal lobe 
and insula. Unfortunately they do 
not report on the frequency of occur- 
rence. It could be inferred 
least 31 of the such 
atrophy. Again it is difficult to assess 


present, 
this 
Rubin's 


also 


cases showed 
these findings since the authors used 
no control group and hence could not 
evaluate the frequency of atrophy 
among nonschizophrenics. 

* Davis (20) studied the EEG’s of 
132 schizophrenics in an attempt to 
work out a classification system for 
schizophrenic EEG's. She found that 
she could isolate three general types 
of patterns. Group I showed essen- 
tially no differences from the normal. 


that at: 


Group [| showed dysrhythmias with 
out convulsive characteristics. Group 
III contained which 
gested the possibility of localized o1 
generalized irritative lesions, cysts, 
atrophy, or other pathological condi- 
tions. This study again suggests that 
an appreciable number of  schizo- 
phrenics show electrical activity of 
the cerebral cortex which has long 
been with 
structural brain damage. 

Finley and Campbell (31) studied 


the EEG’s of 500 schizophrenics and 


records 


Ssug- 


associated some form ol 


250 normals using a triple classifica- 
tion similar to that of the previously 
mentioned investigator. They cate- 
gorized the records as normal, bor- 
abnormal. When these 
classifications were appled to their 
two groups they found 40 per cent of 
the schizophrenics had normal re 


derline, or 


ords while 70 per cent of the normal 
Twenty- 
eight per cent of the schizophrenics 
had abnormal records while 7 per cent 
of the normals had abnormal records. 
The remaining subjects had records 
in the borderline category. Thus the 


group had normal records. 


incidence of abnormal tracings among 
S\ hizopt renics was twice that of the 
normal group. Finley and Campbell 
found that 30 per cent of 
tients’ records showed tracings in- 
dicative of a disorder of function in 
the central system. They 
point out, that the EEG 
pattern is not diagnostic of schizo- 


the pa 


nervous 
however, 


phrenics since the kinds of wave ac- 
tivity found are not unique to this 
group. 

Kennard and Levy (50) found 
100 schizophrenic patients 
exactly the same 60 per cent of ab- 
normal patterns as reported by Fin 
ley and Campbell. They looked for 
correlations the abnormal 
patterns and the patients’ clinical 
pictures. The incidence of abnormal 
EEG's was higher for those patients 


among 


between 
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with early onset and prolonged ill- 
If the illness had lasted three 
the ab- 
normal tracings was only 42 per cent 
as compared to 72 per cent if the ill- 


ness. 


years or less, incidence of 


ness had persisted over ten years. 
This relationship also seemed to hold 
in regard to the degree of intellectual 
impairment shown by the Wechsler- 
Bellevue Test. It is that 
these findings are consistent with the 

...the schizophrenic 
be] thought 
progressive disorder which ultin 


suggested 


idea that 
oi as a 
ately 


profoundly affects the performance of 


process can 


svstems including that of 
(50, p. 421 

Serota (35) implicate 
their EEG 
Their report 
is in the nature of an hypothesis re 


all Orzalh le 
the cerebral cortex” 

Grinker and 
the hypothalamus in 
study of schizophrenia. 


sulting from the observation of some 
1utonomiic reaction in schizophrenia 
There was little electrical activity in 
the hypothalamus and cortex whe 

subjected to ex- 


found that 


their patients were 


cold. Thev 


1 
also 


ternal 


electrical stimulation of the hypo- 


thalamus failed to arouse activity in 
that structure or in the cortex 
Among neurotics and normals an it 
activity was 
They suggest that these fi 
physiological 


crease in apparent 
ndings indi- 
failure at ne 
cortical level because the usual hy po- 
thalamic 
The 


these formulat 


cate a 


lacking. 
inference could be made 


ons that the thal 


driving, lorce 1s 

from 

imo- 

cortical circuits are not functioning 

so that the cortex is deprived of im- 

lower centers. The Tu- 
} 


roup he ded by He ath 38 


ugvgeste 1 that 


pulses from 
lane ¢ 


las 


Patel 
also s subcortical cen- 


ters mav be implicated in schizo- 


phrenia. They report spiking activity 
from the thalamus in the 
region. EEG's from 
implanted by the Horsley-Clarke ap- 
paratus in subcortical areas of schizo- 


phrenic patients often showed the 


areas near 


septal leads 
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spiking activity generally believed 
indicative of structural damage. 
They apparently felt that if they suc 
ceeded in eliminating the spike by 
administering electric shock through 
the subcortical electrodes, the patient 
had a better chance for recovery than 
if the spike activity continued. 

One other IEG study is of inci- 
dental interest. Turner and Lowinger 
77) saw a marked difference in prog- 
nosis for recovery of schizophrenics 
given shock treatment, depending on 
the preshock EEG pattern. None of 
the patients with abnormal preshock 
waves was able to leave the hospital 
treatment. One-fourth of those 
with normal patterns were discharged 
followi vy the tre Again, a 
relationship between the presence of 
al wave activity and malig- 


nancy is suggested. 


ilter 
atment. 
ibnorn 
In summary, is interesting to 
note the progressive changes in the 
significance ascribed to EEG findings 
n the research on schizophrenia. The 
studies emphasize lack of con- 

In the later 
greater confidence is placed 
that EEG studies have 
demonstrated brain dysfunction in an 


nev in the trac ings. 
idea 


appreciable number of cases of schizo- 
phrenia, the 
tracings. Finally, some investigators 
have been willing to state the nature 
of the difficulty, i.e., atrophy of the 
cortex and or thalani: 
Phe histopathole 
gists have been much more speciti 
in their 


despite variability in 


failure ot 
cortical circuits 
statements regarding the 
nature of the brain pathology found. 
This la of studies will be 
reviewed 


reer group 


next. 


HISTOPATHOLOGICAL STUDIES 


One of the persistent proponents 
of organic factors in schizophrenia is 
Eugen Bleuler (12). His monograph 
on the si hizophrenias is no doubt still 


the outstanding text in the field. It 
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was his early clinical observations of 
sensory disorders among such pa- 
tients that led him to suggest organic 
dysfunction of the brain. In 1930, 
Bleuler wrote a provocative essay 
(13) in which he tried to sort the 
psychogenic from the physiogenic 
in schizophrenia. It seems worth- 
while to quote part of his conclusions 
in regard to the role of the central 
nervous system in schizophrenia: 
Many cases of stupor, with their general 
prostration of the elementary psychic func- 
tions, conception and train of thought, often 
point clearly to brain pressure and, on autopsy 
tense edema of the pia or brain swelling is 
found. In the various forms of such deliria, 
the fundamental similarity of certain symp- 
toms or of the whole picture to other physio- 
genic conditions, intoxication, fever psychoses, 
epileptic absences, meningitis, encephalitis, 
cannot be denied, and in all such cases we also 
find in the autopsy histological alterations of 
the brain tissue which show some uniformity. 
But in all chronic cases too, decreases in the 
amount of ganglion cells and certain changes 
in the glia, furnish proof that we are in the 
presence of a brain lesion...”’ (13, p. 206). 


There are serious difficulties in ex- 
perimental histopathology, many of 
which have not been overcome. For 
instance, as Spielmeyer (74) points 
out, it must be remembered in using 
necropsy material that the diseases 
which cause death may themselves 
bring about changes in the brain. 
Also, it is not known what the agonal 
process itself may do to brain cells. 
Neither of these variables has been 
thoroughly explored. It might be 
added that although biopsy material 
can now be obtained easily at the 
time of prefrontal lobotomy, control 
material from normal subjects is not 
so readily procured. If control ma- 
terial is not obtained, it is impossible 
to assess the extent to which the re- 
sults may be attributed to the experi- 
mental procedure. Many of the stu- 
dies to be reviewed in this section 
have not overcome these difficulties 
altogether. Hence, the degree of con- 
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fidence to be placed in the conclusions 
of these studies is necessarily limited, 
as is also the generalizability of the 
results. 

In 1930, Spielmeyer (74) recog- 
nized these potential sources of error, 
but still felt justified in concluding 
that characteristic histological 
changes were evident in schizo- 
phrenia. The finding he emphasizes 
is the loss of cells in the cortex. He 
also reports changes in glial cells and 
the presence of destruction products. 
He concludes that there is sufficient 
evidence to state definitely that 
schizophrenia is an organic process. 

In 1932, Bamford and Bean (6) 


studied a series of cases they describe 


as acute dementia praecox. These 
authors report on three cases but 
base their conclusions on an unspeci- 
fied number of additional cases. 
Their conclusion is quite different 
from Spielmeyer’s. In fact, this study 
is one of the few since 1930 reporting 
completely negative results. (It 
should be remembered, of course, 
that negative results are less likely 
to be reported than are positive find- 
ings.) 

As is often found in the psychiatric 
literature, there are a number of case 
studies reported. Several of these 
will be reviewed even though they do 
not represent well-controlled nor 
truly experimental studies. It is 
perhaps not necessary to state that 
fruitful hypotheses may be derived 
from observation of the single case. 

Ferraro (29) reports two cases di- 
agnosed dementia praecox. He found 
swelling of the oligodendroglia in 
both cortical and subcortical tissue, 
but enlargement was most prominent 
in frontal and temporal areas. He 
also noted demyelinization, perivas- 
cular infiltration, and the presence 
of gitter cells. These were taken to be 
indications of a degenerative process 
not to be expected in the normal 
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brain. Ebaugh, Barnacle, and Neu- 
berger (25) detected histological al- 
terations in fatalities resulting from 
electroshock therapy. 
changes in 


They found 
and in the 
neuroglia, and suggest that although 
these modifications may have been 
related to the shock they were not 
limited to areas of strongest current. 


astrocytes 


Their conclusions are not explicitly 
stated, but they imply that shock 
alone could not account for the brain 
Ferraro (30) reports an- 
other case which had been diagnosed 
as schizophrenia for more than ten 
years. The clinical picture 
that of a schizophrenic 
but on autopsy 

myelination of the brain was found. 
The disease was apparently so wide- 


damage. 


was 
1 
clearky psy- 


} 


chosis a diffuse de- 


spread and progressive that it could 
scarcely be attributed to a sudden ill- 
Polatin, 
rera (66 

of chronic 


and Bar- 
describe two typical cases 


ness. Eisenstein, 
schizophrenia, similar to 
Their discovery of 
slow wave activity in the EEG's was 


those of Ferraro. 


the first clue to the presence of or- 


| 
yanhic damage; pneumoenceph 1o0g- 


raphy revealed cerebral atrophy; and 
the 


brain disease. Biopsy studies revealed 


Rorschach suggested organic 


what the authors considered to be 


irreversible changes in the cerebral 
cortex. 

Roizan, Moriarityv, and Weil (68 
the 34-year-old 
woman who died after a brief acute 


psyche SIS 


examine case of a 
catatonic 
Here, as in Ferraro’s 
research, study of the brain tissue re- 
vealed a demyelinating process in 
the central nervous system. These 
authors interpret their findings not 
as indicating that schizophrenia is an 
but that impaired 
brain function acts as a precipitating 
factor for a schizophrenic reaction. 

Holt and Tedeschi (45) and Van 
der Horst (78) report—as did the 


diagnosed as 


schizophrenia. 


organic process 
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previous investigators—the presence 
in individual cases of demyeliniza- 
tion, microglial and oligodendroglial 
hyperplasia, loss of cells, and vascular 
infiltration. Additionally, Holt and 
detected cytoplasmic in- 
in the nerve cells them- 
with nuclear disturb- 
ances and lipoid deposits. 


Tedeschi 
volvement 
selves along 
One of the most interesting and 
best controlled studies was made in 
1944 by Kirschbaum and Heilbrunn 
(51). They took biopsies from the 
frontal lobes of eleven chronic schizo- 
phrenics and also from three normal 
subjects. Biopsies from several cats 
and rats provided further control 
For the experimental ma- 
report ganglion cell de- 
as well as alterations in 
glial cells and blood vessels. These 
changes are similar to those found in 
chronic and metabolic 
Comparison of experi- 
mental and control material indicated 
that the results could not be attrib- 
uted to the experimental proce- 
dures. This study seems most im- 


material. 
terial they 


generat ion 


intoxication 
disc re lers. 


pressive because many of the method- 
ological flaws of other studies were 
avoided: The tissue was taken from 
the living organism and studied im- 
mediately, thus ruling out the com- 
plicating effects of the agonal process 
as well as the disease which might 
cause death. Also, the use of con- 
trols in this study held constant any 
effects the anesthetic and surgical 
procedures might have had. 
Another — well-executed biopsy 
study is reported by Elvidge and 
Reed (27). They used material from 
19 psychotic patients, 13 of whom 
were schizophrenic. Biopsies were 
16 control sub- 
jects, mostly epileptics, and from 14 
laboratory animals. They too report 
for the schizophrenic subjects changes 
in glial cells, with hypertrophy and 
swelling. In some cases the oligoden- 


ilso obtained from 
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droglia showed nuclear modification. 
These changes were most apparent 
in the white matter; in some cases 
they were generalized, while in other 
cases they appeared more patchy. 
Similar changes were found for those 
control cases in which the patient 
showed clouding of consciousness be- 
tween epileptic seizures, but not for 
those cases in which the patient re- 
mained 
seizures. 
were followed over periods of from 
one to two years, and the biopsies 
were repeated. The findings were the 
same as before. It is unfortunate that 
none of the cases had remitted be- 
tween biopsies, for if it could have 
been demonstrated that the changes 


oriented between 
A number of the psychotics 


clear and 


had slowed down or ceased in recov- 
ered cases, it would have been an im- 
pressive argument for the importance 
of organic factors in schizophrenia. 
Elvidge and Reed suggest that their 
results indicate disturbances or in- 
terruptions of commissural pathways 
in the brain, and that it is this ‘‘dis- 
ruption” of impulses to and from var- 
ious parts of the brain which causes 
disturbances in intellectual and emo- 
tional reactions. 

Winkelman and Book (80) made 
a post-mortem study of 10 cases of 
“typical” schizophrenia. Their find- 
ings are listed as (a) focal cell loss, 
(b) a general decrease in the number 
of nerve cells, especially in the an- 
terior half of the brain, (c) chronic 
cell diseases, including cell shrinkage 
and ghost cells, (d) amount of fat in 
cells usually greater than expected, 
mild 
general demvelinization in subcortex. 
They claim that the changes found 
in the brains of their patients were 
much quantitatively 
than those usually found in nonpsy- 
chotic cases. 

From another post-mortem study 
come similar results. Rupp and Wil- 


(e) increase in astroglia, and (f) 


more severe 


son (71) studied 37 patients, all under 
fifty years of age. Biopsies revealed 
loss of cells, gliosis, atrophy, edema, 
vascular lesions, and areas of soften- 
ing. The authors do not report data 
for control subjects. 

Probably the most persistent in- 
vestigators in the histopathology of 
schizophrenia are Papez, and Papez 
and Bateman (57, 58, 59, 60, 61, 62, 
63). Their methods and techniques 
are carefully detailed and appear to 


meet the standards of good research, 
with the exception that the extent to 
which they have studied control ma- 
terial is not always made explicit. 
In 1944, Papez (57) reported tinding 


dark — staining 


particles, inclusion 
bodies, in the cytoplasm of nerve 
cells. According to him, these parti- 
cles were not artifacts produced by 
the technique but were living micro- 
organisms of a parasitic nature. In 
1949, Papez and Bateman (59) made 
another study of biopsies. In their 
sample of 50 patients, there were 42 
schizophrenics, 6 manic depressives, 
and 2 syphilitics. They identified 
three stages of cell disease. In the 
first stage, the cells showed intact cy- 
toplasm occupied by a small number 
of the inclusion bodies. Cell nuclei 
showed some deformity. The second 
stage revealed highly vesicular nu- 
clei, enlarged nucleoli, and cytoplasm 
filled with The 
third characterized by 
naked nuclei stripped of cytoplasm 


inclusion bodies. 


stage was 
The inclusion bodies had apparently 
invaded the walls of small blood ves- 
sels, producing degenerative changes. 
The authors studied these inclusion 
bodies under a dark phase microscope 
and concluded that they were dealing 
with a living organism. They were 
able to watch these zooid organisms 
in variotis phases of the life cycle and 
to keep them alive over a period of 
many hours in a suspension of cortical 
material. Drawings made from mi- 
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croscope slides are presented in the 
article. They indeed suggest an om- 
inous picture of parasitic infestation. 
Papez and Bateman propose and 
argue convincingly for the thesis that 
the histopathological changes re- 
ported by other are 
associated with various phases in the 
life cycle of the organism. 
Schaderwald (72) confirmed many 
of the findings of the previous in- 
vestigators, using material from 34 
biopsies taken at the time of pre- 
frontal lobotomy. He states that his 


investigators 


data is in complete agreement with 
that reported by Papez, and Papez 
and Bateman. However, because of 


the lack of controls, he is much more 


cautious about ascribing pathogno- 


monic significance to the findings. 

The reviewer 
Weinstein (79 in discussing t 
dies of Papez, and Papez and Bate- 
man: ‘‘Merejy the unusual nature of 


must agree with 


he stu- 


be allowed to 
interfere With open-minded consider- 
ations of the findings . Many 
questions remain which can be an- 
swered only through carefully de- 
signed experimental work with ade- 
quate attention to controlled 
" (76, p. 549 
The thalamus and autonomic nerv- 
ous system have also been implicated 


the claims must not 


obse :- 
vations... 


as sites of possible lesions in schizo- 
phrenia. In 1939, Stein and Ziegler 
(75) reported a biometric analysis of 
24 patients and 1 normal 
The brains and thalami in the schizo- 
phrenic patients were smaller and the 
cell count lower than for manic-de- 
How- 


subject. 


pressive patients in general. 
ever, the 
statistically significant. The investi- 
gators felt that the small size of their 
sample contributed to the negative 
findings. 

For a group of 16 paranoid schizo- 
phrenics, Bateman and Papez (7) 
found inclusion bodies in thalamic 


none of differences was 


cells as well-as cell loss as high as 90 
per cent in the association nuclei. On 
this finding they based their proposi- 
tion that hallucinations and delusions 
are the result of abnormal 
discharge of the thalamus to the 
cortex. Since sensory input from 
peripheral sense organs must pass 
through the thalamus, the presence 
of diseased cells there could distort 
the impulses to such an extent that 
the misperceptions characteristic of 
psychoses would be expected. 

Two reports question the impor- 


nervous 


tance of histopathological tindings in 
schizophrenia. In 1949, Rowland 
Mettler (69) studied 22 schizo- 
phrenics and 1 manic patient. They 
hypothesized that if cell loss is actu- 


and 


ally an important feature in psycho- 
sis, it should be positively correlated 
with However, 
their findings indicated no difference 


severity of illness. 


in cell count 
pitalized 


between a group hos- 
than twenty-two 
months and a group hospitalized less 


more 


than twenty-two months. They con- 
tend that the various methods of 
counting cells are frequently of ques- 
tionable reliabilitv, and suggest that 
this fact in large measure accounts 
for the confli ting reports of cell loss 
Phe same opinion is voiced by Wolf 
and Cowan (81). They add that per- 
haps the positive results which have 
been reported can be explained on 
the basis of misjudgment of the limits 
of normal variation in brain tissue, 
and on the basis of methodological 
artifacts. They feel that available 
not sufficient to permit 
accepting the thesis of a specific or- 
ganic factor in schizophrenia. This 
position is vigorously opposed by 
Winkleman in a discussion following 
their article. 

In summary, it can be stated that 
a number of different pathologists 
report similar results. A frequent 
finding is loss of nerve cells. Pneumo- 


studies are 








GLEN A. BRACKBILL 


encephalographic reports of atrophy 
appear to support this impression. 
Other repeated findings are: the pres- 
ence of a demyelinizing process, de- 
generating nerve cells, and patho- 
logical changes in the various glial 
cells. It seems fairly well established 
that the brains of some schizophrenics 
show structural damage, but the sig- 
nificance of the damage is difficult to 
evaluate because comparable studies 
of nonpsychotic subjects are generally 
lacking. 


BIOCHEMICAL STUDIES 


So far, very few biochemical stud- 
ies of brain function in psychosis 
have been made. In 1930, Freeman 
(33), after agreeing with Dunlap that 
there were no histopathological find- 
ings in schizophrenia, reported a de- 
ficiency of catalytic iron in the brain 
cells of schizophrenics. His sample 
included 35 schizophrenics and 16 
patients of other diagnoses. Freeman 
suggests that the lack of this catalytic 
agent (which is necessary for the use 
of oxygen) may underlie some of the 
symptoms. <An additional inference 
that could be made from these find- 
ings is that the inability of the cells 
to take up oxygen would eventuate in 
their degeneration. 

Doust (23) used two samples of 
schizophrenics, one English and one 
American, totaling 87 patients. He 
studied the brains by means of the 
spectroscope and photoelectric oxim- 
etry, and found marked 
in the effective capillary oxyhemo- 
globin. In view of the extreme sensi- 
tivity of nervous tissue to anoxia and 
the irreversibility of the consequent 
damage, Doust's results see 


reduction 


m quite 
important to the question of central 
nervous i 
schizophrenia. Also, it is interesting 
to note that this investigator reports 
a much more profound deficiency in 
capillary oxyhemoglobin among the 


system involvement in 


so-called ‘‘process’’ schizophrenics 
(i.e., classical dementia praecox) than 
for any other group studied. The re- 
viewer feels that this finding has con- 
siderable significance for methods of 
selecting groups of schizophrenics for 
study. This point will be elaborated 
in the discussion to follow. 


PHYSIOLOGICAL STUDIES 


From England, Hoffer, Osmund, 
and Smythies (44) report the acci- 
dental discovery of adrenochrome, or 
pink adrenaline. This substance, 
closely related to adrenaline, is a de- 
terioration product of that hormone. 
Although chemically it is only slightly 
different from adrenaline, 
on the body is quite different. It 
shows toxic effects in much the same 


its etfect 


For this rea- 
son, adrenochrome caught the inter- 
est of investigators 
toxic ity 


way as does mescaline. 


hypothesizing 
is a basis for schizophrenia. 
(The catatonic subgroup has usually 
been the object of such study, since 
they present many of the symptoms 
generally indicative of toxic disor- 
ders.) 

Hoffer, and Smythies 
studied the effects of pink adrenaline 
on a group of volunteer normal sub- 
jects. The results are striking. When 
given adrenochrome, their 


Osmund, 


subjec ts 


showed behavioral symptoms of schi- 


zophrenia. Furthermore, their sub- 
jective reports confirmed the observ- 
er’s impressions that they were de- 
lusional and experienced hallucina- 
tions. 

under 

showed 


EEG’s taken on the subjects 
the influence of the drug’ 

arrhythmias epileptic 
patterns within hour from 
the time of administration. The 
authors hypothesize that when adren- 
ochron e 


inhibits 


and 
half an 


enters the cerebral cells it 
carbohydrate metabolism 
and interferes with cell respiration. 
In schizophrenics, an excess of adren- 
ochrome may be produced, resulting 
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in autointoxication. One might also 


speculate about a possible connection 


between an excessive production ot 


hypot hesis 


and the 
that schizophrenics suffer a chroni 
hyperactivity of the sympathetico- 
adrenal system. 

The 


considered the site of possible path- 


adrenochrome 


endocrine system has been 
ology in schizophrenia for well over 
halt a century. Every gland from sex 
to pituit iry has at one time or an- 
other by various investigators 


identified as the locus of the 
culty. The 
tantalizing but equivocal, and 
work done in this 

a bibliography several t 
larger than the one 1 
ever, a few of the studies which ex- 
mt ; 


) 


findings have 


area woul 


stitute 
present 
citly link endocrine a 
nervous system function 
tioned. 

46), 


} . 
enaocrine 


in reviewing 
function 


reports some ol 


Hoskins 
literature on 
schizophrenics, 
work done over a period ol twent 
years at Worcester. His observatior 
led to the conclusion that i 
preciable number of ¢ 
dystunc tion 1s a basic f 


ises 

ictor, 
schizophren patients seem to sul 
from a lack of thyroid secretion 
continued treatment 


raised the 


Cver, 
horn 


with 
one question 

whether the target tissue was not at 
fault. 
did not respond even to massive doses 
of thyroxin, Hoskins concluded that 
the difficulty thy- 


roid gland production itself as it is in 


That is, since some patients 


is not so much in 


an apparent failure of the appropriate 
receiving or ‘target’ ti 
the hormone. 


sack re 
tissue to utilize 


The other glands which frequenth 


show anomalous reactions are the 


‘he function of the adre- 
nals is relevant to our subject, for 


adrenals. 


while there is no direct nervous con- 
trol of the adrenal cortex, its function 


DYSFUNCTION IN 


SCHIZOPHRI 219 
that ot the 
Indi- 
rectly, the secretions of the adrenal 
influenced by the auto- 
through the 
which is controlled by 


related to 
nervous 


iS iitinmi tely 


autonomic system. 
cortex are 
nomic nervous system 
hypophysis, 
the hypothalamus. In the past few 
years there has appe ired an impres- 
sive number of 
function in schizophrenia. 


adrenal 
No doubt 
the research on adrenal activity has 
been accelerated because ACTH and 


have 


studies on 


cortisone become available as 
tools. 

460 suggests that in 

he adrenal-autonomi 


disturbed, 


research 


somehow 
nately expressing itself 
‘tabolism. Work on the 
nd the emergency functions 
followed up by Hoagland 
17, 39, 40, 41, 42). These 
vpical of the many re- 
primary finding appears 
the schizophrenic exhibits 
gically subnormal response 
Altschule, Pron esel, and 
Parkhurst (3) found that the reac- 
tions of schizophrenics to ACTH in- 
they 
propose that the difficulty les in 
t Perhaps this 
‘fails to activate the pitui- 


nO 


to stress. 


ect iS are not reduced, so 


1 1 
ne hypothalamus 


iry to secrete sufticient adrenocorti- 
cotropic hormone, resulting in a re- 
the usual adrenal stress 
Hoagland et al. (39) sup- 


port this point of view. 


duction of 


reaction. 


\fter a rather thorough review of 
the literature on the bodily functions 
in psvchoses, Altschule (2) 


that 


concludes 
disordered physi- 


ology reported is a 


most of the 
consequence olf 
the psvchosis rather than an etiologi- 
cal factor. He 
namely 


nakes one exception, 
that disturbances in adreno- 
cortical function may cause metabolic 
which with 
brain function, producing psychoses. 
He emphasizes that there has been a 


disturbances interfere 
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general tendency to ignore the physi- 
ology of mental illness. This tend- 
ency has been reinforced by the con- 
tinued use of the term ‘‘functional,”’ 
which implies, wrongly, a lack of 
physical or biological imbalance. Ac- 
tually, there is no such thing as a 
functional psychosis. Altschule states 
in regard to this point: ‘‘Theories 
that ignore cerebral metabolic proc- 
esses and regard psychosis as merely 
a state of mind, deliberately chosen 
by the patient because of environ- 
mental influences, have 
not even reasonableness 
them”’ (2, p. 212). 

Paralleling the endocrine studies is 
a series of experiments with various 
drugs which produce ‘‘model-psy- 
choses”’ (32). Studies by DeJong (21) 
with mescaline, bulbocapnine, and 
other catalepsy-producing drugs led 
him to hypothesize a ‘‘faulty detoxifi- 
cation theory’ of catatonia. Accord- 
ing to him, the liver is the organ at 
fault. 

Since DeJong’s studies, numerous 
other drugs have been used in an at- 
tempt to find a toxic agent responsi- 
ble for psychosis. Of these, one 
which has recently received consider- 
able attention is lysergic acid diethyl- 
amide (LSD-25) (22, 32, 43, 47, 52). 
This drug seems to act on the hypo- 
thalamus and in normals, produces 
many of the psychological effects 
characteristic of the schizophrenic 
psychoses. The drug has also been 
administered to schizophrenics, and 
the reported effects vary. In general, 
however, the effect seems to be one of 
exacerbating existing symptoms. For 


nothing— 
-to support 


normals, these effects are reported: 
visual disturbances, thought disor- 
ders, mood changes, paranoid tend- 


encies, auditory phenomena, ete. 
Physiological changes are also noted. 
These changes are principally those 
associated with autonomic activity. 
This leads to the hypothesis that it is 
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the homeostatic balance which is dis- 
burbed by the drug (67). 

On the negative side, Sloane and 
Doust (73) interpret their results as 
indicating that schizophrenics show 
a diminished or unchanged auto- 
nomic reactivity following adminis- 
tration of LSD-25. Their sample in- 
cluded 19 patients and 14 controls. 
In an attempt to clarify the specific 
action of the drug, Mayer-Gross, 
McAdam, and Walker (52) found 
that it produced changes in phos- 
phate metabolism. They propose 
that the apparent increased respira- 
tion of brain tissue may account for 
the psychological disturbances pro- 
duced by the drug. 

It is important to note that the 
last studies reviewed are also of 
methodological interest, for when a 
disease can be artifically produced, 
it is an important step in the direc- 
tion of control of that disease. The 
chemist understand his com- 
pound completely, not when he has 
succeeded in analyzing it but only 
after he can synthesize it in his labo- 
ratory. He is then in a position to 
control it and predict its action. It 
is evident, of course, that the reac- 
tions produced by the administration 
of a drug to normal persons cannot 
at this stage be unequivocally 
equated with schizophrenic — psy- 
choses. For that matter, as is pointed 
out by Pennes (64), the reactions to 
LSD-25 cannot be entirely due to the 
specific action of the drug; otherwise, 
the wide variability in its effects 
would not have been found. Ele too 
found that not all of his 55 schizo- 
phrenic subjects reacted to the drug 
in the same way. Actually, in some 
cases the drug seemed to have a 
temporary normalizing effect. The 
most judicious conclusion is probably 
that response to the drug is the result 
of interaction between psychological 
and physiological factors. 


call 
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In spite of the large amount of 
work done with various drugs, the 
mechanism of their action is still not 
well understood. Nevertheless, more 
and more of them are being 
covered and tried out. Some, as re- 
ported by the press, have assumed 
the status of wonder drugs. 


dis- 


Reser- 
pine (54) and chlorpromazine are two 
such. Again, the effectiveness of 
these drugs seems primarily ascribed 
to their action on the 
nervous system (36). 


autonomic 


PSYCHOLOGICAL STUDIES 


It is rather disconcerting to find 
that schizophrenic patients often re- 
spond to psychological tests in ways 
that we have learned to consider 
characteristic of patients with known 
organic involvement. This presents 
a difficult problem to the clinician 
who is asked to make a differential 
diagnosis. In the past, test construc- 
tors have been busy trying to show 
how well their tests differentiate the 
current groups. Little 
attention has been paid to the degree 
of overlap, even though the similari- 
ties between two groups on test per- 


nosological 


formance may be actually more im- 
pressive than the differences. One 
exception is the report of Hanfmann 
and Ikasanin (37) in the discussion of 
their widely known test of conceptual 
thinking. They small but 
significant mean difference between 


found a 


groups of known organics and schizo- 
phrenics. the 
similarity of schizophrenics to or- 
ganics, they state, 


Regarding response 
“Cases of this type 
often bear a resemblance to 
irreversible brain 
except for a greater variability and a 
certain imaginative quality of their 
productions which are lacking in or- 
ganic cases’ (37, p. 75). It might be 
added that this imaginative quality 
is often very difficult to detect. 
Altrocchi and Rosenberg (1) also 


close 


cases of disease, 
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found overlap of schizophrenics and 
organics in their work with a new 
block test of conceptual thinking. 
Some of the so-called functional psy- 
chotics could not be differentiated 
from known organic cases, on the 
basis of the test, although others 
could. The hypothesis is immediately 
suggested that among the heterogene- 
ous group of mental ills called ‘‘schiz- 
ophrenia,”’ there is a considerable 
percentage suffering chronic. brain 
damage. 

In an attempt to test this hypothe- 
sis, Brackbill and Fine (14) studied 
three groups of patients: a group of 
cases of known organic damage; a 
group of cases of classical or process 
schizophrenia, selected on the basis 
suggested by Kantor, Wallner, and 
Winder (49 
or acute ¢ 


65) 


volvement, they found that the or- 


; and a group of reactive 
Using Piotrowski's 
Rorschach signs of organic in- 


Ases. 


ganic and process groups could not 
be distinguished from each other, 
while the reactive group showed sig- 
nificantly fewer organic signs than 
did either of the other two groups of 
patients. This study provides addi- 
tional support for the thesis that 
among schizophrenics there is an ap- 
preciable number who suffer central 
nervous system pathology. 

Another approach illustrative of 
the type of contribution which could 
be made by psychologists is that of 
Meadow and Funkenstein (53). They 
classified a group of 58 schizophrenic 
patients into three subgroups on the 


basis of autonomic reactivity. Group 
\ had an autonomic pattern indica- 
tive of some kind of 


from 
suggesting a 
thalamo-cortical or 

interaction. On 
tests of abstract thinking these pa- 
tients showed the severe impairment 
usually associated with brain dam- 
age. The general behavior of these 


‘‘release”’ 
cortical inhibition, 
breakdown in 

cortico-thalamic 
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patients was of the nature which 
typically indicates poor prognosis. 
On the other extreme were the symp- 
tomatically schizophrenic patients of 
group B, who showed no disturbance 
in autonomic reactivity and no dis- 
turbance in abstract thinking, little 
anxiety, and no organized delusions 
Prognosis was guarded, but was bet- 
ter than for group A. The C group 
showed severe anxiety, a different 
pattern of autonomic balance, and 
the best prognosis. It could be hy- 
pothesized that group A represented 
the classical or typical schizophrenic 
with organic symptoms, while the 
other two groups represented reac- 
tive or situational disturbances. 


HYPOTHESES AND SUGGESTIONS 


The various investigators in this 
area have suggested a number of 
ideas and concepts in regard to the 
meaningfulness of these findings. 
Eickhoff (26) and Anderson (4) pro- 
pose an organic factor in childhood 
schizophrenia. The former sum- 
marizes her observations as follows: 

I am therefore, postulating that schizo- 
phrenia in childhood is an arrest in the devel- 
opment of abstract thought and emotional 
maturity at an infant or toddler level; that 
this arrest is dependent basically upon a de- 
fect in the acquisition 
and this is due either to a defect in 
logical systems concerned with pain, 


of general sensation; 
» neuro- 
touch, 
temperature, position, and vibration sense or 
to faulty stimulation the outside or 
both; and this defect leads to a delay in the 
formation of the body image and other im- 
ages’’ (26, p. 234). 


from 


Much the same statement is made 
by Anderson as the result of her ob- 
servation of schizophrenic children: 

This outstanding modification in my con- 
ceptual thinking schizophrenia 
would be in seeing the development of these 
reactions as being determined not by 


concerning 


i rather 


ambiguous rejection behavior on the part of 
i rails 
the mother, or any other significant 


erson, 
but as a failure in interpersonal relations 
brought about primarily by a verv specific 
type of organic brain deficit in the child. The 
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exact nature of the physiologic pathology 
would probably be in the associational path- 
ways of the most superficial layers of the 
cortex. The deficit might be of any degree 
/ 


from minimal to extensive (4, p. 37). 


Lauretta Bender's (10, 11) long 
years of work with schizophrenic 
children has led her to a_ biological 
conception of schizophrenia. At 
times she speaks of it as an organic 
brain disease, proposing that there 
is an inherited weakness in the home- 
ostatic mechanism. This weakness in 
autonomic function is such that the 
usual crises of physiological develop- 
ment may be sufficient to precipitate 
the illness. Follow-up studies of 
schizophrenic children showed that 
the symptoms closest to the biological 
were the best predictors of adult 
schizophrenia. This again bolsters 
the concept of a process  schizo- 
phrenia, which starts early, is malig- 
nant, and is nonremitting. 

All three of these investigators see 
evidence of 
these 
phrenia. It 


brain damage even in 
early cases of schizo- 
is interesting that the 
defect is suspected to be somewhere 


most 


on the input side of the nervous svs- 
tem, for Bateman and Papez (7) 
also suggest a distortion on the input 
side, resulting from cell disease in the 
thalamus. Also, according to Nielsen 
(56), there is a kind of schizophrenia 
that is based on a diencephalic lesion 
of a specific nature. He believes that 
the disease may eventually reach the 
cortex late in the process, and pro- 
poses that the defect is probably in- 
born, producing a disorganization of 
neuronal patterns. 
call this an 


phrenia.”’ 


He proposes to 
‘apsychotic —schizo- 
Patients of this group are 
called ‘“‘ambulatory 
phrenics.”’ They are nomadic, make 
a borderline schizoid adjustment, 
and their contacts with hospital and 
clinic are unproductive of any change 
in the pattern of adjustment. 


also schizo- 
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Bychowski (15, 16) speaks of 
schizophrenic thinking as similar to 
the kind of organic de- 
scribed by Goldstein. After pointing 
out the concrete character and the 
similarity of thought processes for 
the two groups, he arrives at the 
hypothesis that there is a dynamic 
deficiency of cortical and subcortical 
organization in schizophrenia. He 
that a group ol 
schizophrenics suffer an agnosia and 
apraxia of thought based on cerebral 
deficit. 

Bellak (8, 9), after reviewing some 


iS 


reactions 


proposes certain 


> 


3,500 papers on schizophrenia, 
formulated what he calls a 
factor theory.” 


has 
‘“‘multiple 
He maintains that 
the illness can be the result of a multi- 
plic itv of etiological factors. In one 
might be the result of 
Vv: in another, oreanic 


live r 
] 


case it 
lc 


brain 


patho 
disease; etc. The symptomatology of 
schizophret ia may be sort ol 
for the ex- 
produced by 
This formula- 
tion is apparently an attempt to en- 


“final common pathway” 
illnesses 
many different causes. 


pression of 


compass the divergent 
confusing—findings in schizophrenia. 
In such a system, one could think of 
the behavioral 


the phenotype which is the expres- 


symptomatology as 


a number of different geno- 


sion of 
typical factors. 

It is the reviewer's impression that 
the findings reported by these in- 
vestigators merit more consideratio1 
psychologists than they 
so far received. It is probably true 
that the resolution of the 


of an organic basis for schizophrenia 


fron have 


question 
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rests with neuropathologists. There 
is, however, room for psychologists 
to make a contribution. It is widely 
accepted that the group now labeled 
‘schizophrenic’ is a heterogeneous 
one, often with little similarity among 
its members. One of the reasons for 
such conflicting research findings 
could well be this heterogeneity. 
That is, it might be hypothesized 
that some of the variability noted 
results from using samples of pa- 
tients actually representing different 
populations. Therefore, it appears 
that one of the first steps in clarifying 
the problems of research in schizo- 
phrenia is to work out a more effec- 
tive clas: scheme than the 
four subgroup system now used. It 
is the reviewer's impression that the 
evidence is sufficient to postulate the 


lication 


existence of organic brain disease in 
some, but not all, schizophrenic pa- 
tients. Those who show the classical 
formal thinking 
concreteness, 


signs of disorder, 
disconnected 
may have some 
kind of structural brain damage. On 
the other hand, those who show delu- 
hallucinations, 
thought disorder, may not have cen- 
tral nervous system pathology. Per- 
haps it is too early to make a definite 
statement regarding the exact nature 
and mechanisms of brain dysfunc- 
tion, 


such as 


associati ns, 


Cnc... 


sions, 


etc., but no 


either etiologically or dynami- 
cally, but certainly there are a num. 
ber of available theories of brain func- 
tion which could be applied to test 
current hypotheses and to suggest 
further ones. 
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In recent years a number of articles 
have been written on the use of 
playing in education 
sessment, 


role 
aS- 


personality 


training, psycho- 


role and } 
therapy. Yet, in spite of the wide- 
spread popularity of role plaving 
comparatively few studies have been 
reported in which role playing 
evaluated experimentally. 

The present article reviews 
experimental studies in an 


Was 


such 
attempt 
to establish the degree to which pres- 


ent practice is by 
mental evidence, and 
problems toward which 
perimentation might most 
be directed. 

A role-playing sit 
fined as 
vidual 
role 
own 


justified experi- 
the 
€X- 


ibly 


to indicate 
future 

prot 
is here de- 
indi- 
to take a 
or if his 
mal for the 


uation 
a situ one in which an 
is ex pli ttly asked 
norm lly his own 
setting not nor 
the role. 

Examples of taking a role 
own are typically found in assessme 
procedures. 

You are a_ twelve-year-old i mga 
brothers and sisters. All year long vou ha 
been looking forward to having vour first 
perience in boy scout camp whi . rul 
months this summer, Car mp be gins next week. 
You are starting to pack as your fathe r enters 
the room (7, p. 374). 


not 
ina for 
enactment of 


’ 
notones 


kor example: 


ex- 


is for two 


Taking one’s own role 
usual setting typically 
context of a procedure 


an un- 
in the 
to 


occurs 
designed 
produce person. ility change. In such 
situations the individual is asked to 
portray scenes from his own life be- 
fore a group of other people. The in- 


1T am indebted to Dr. Edgar Borgatta and 
Dr. Leonard Cottrell of the Russell Sage 
Foundation for reading this manuscript and 
providing many helpful suggestions. 


dividual is 
setting is 
We would like 


handle a customer 


acting his own role but the 
unusual. For example: 


to see how you normally 
when he comes in your 
store. Mr. X plays one of your 
customers. He has just asked you the price 
of a certain article. What do you do? 


Suppose 


Studies of role playing can con- 
veniently be grouped 
with the 
above. 


in accordance 

two definitions considered 
That ms they can be consid- 
ered to treat ro e pl. iving either as a 
method for assessing personality or 
is a method for producing personality 
and behavioral changes. This classi- 
fication will be adopted in the review 
which follows.? 


ROLE PLAYING AS AN 
ASSESSMENT PROCEDURE 


Early work in this area was strictly 
qualitative. In 1945, for example, 
Moreno and Moreno (17) used role 
playing to study the social stereo- 
types of children. Other examples, 
summarized by Eaton (9), occurred 
during World War II. At this time 
was made of role playing in the 
assessment and selection of per- 
The studies of the OSS (18) 
proposed standardized as- 
sessment procedures of Bronfenbren- 
ner and Newcomb (7) are outstand- 
ing examples of the qualitative work 
done during this period. Interest in 
quantitative measurement did not 


use 


sonnel. 


and the 


? There have been a number of studies in 
which the experimenters have consciously 
played a role as part of the experimental 
procedure. These studies are not included in 
this review for two reasons. First, the experi- 
mental subjects themselves do not role-play. 
Second, role playing is used strictly as an 
experimental technique for fabricating reality, 
and is not itself the object of study. 
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occur in this area until experimenters 
became concerned with the reliability 
and validity of their measuring in- 
struments. 


Reliability 


Probably the first to test the relia- 
bility of ratings of role-playing per- 
formance were Rotter and Wickens 
(20). They asked observers to rate 
subjects placed in two standard role- 
playing situations on the single char- 
acteristic of “social aggressiveness.” 
During one situation the subject was 
observed by a panel of four observers. 
During the second situation he was 
observed by a different panel of ob- 
servers. There were also three ob- 
servers who watched his behavior in 
both situations. Under these condi- 
tions a mean interrater reliability 
coefficient of .71 was obtained for a 
panel observing a single situation 
(N of 48). The correlation between 
the mean ratings of the members of 
the two panels on the behavior of the 
48 subjects over two situations was 
.55, while the mean ratings of the 
three observers who watched the sub- 
jects in both situations had a reli- 
ability coefficient of .77. The .55 
correlation may be interpreted not 
only as rater reliability but as subject 
consistency from situation to situa- 


tion. The larger .77 correlation may 
result in part from a halo effect. 


Stanton and Litwak (22), in a 
study devoted to evolving a test of 
autonomy, asked five judges to re- 
cord the number of times they heard 
examples of ‘‘nonideal”’ behavior on 
11 tape recordings of role-playing 
situations.? In this way they ob- 
tained a mean reliability coefficient 
of .90 (rho). 

Moldawsky (16), also using tape 


Stanton and Litwak equate nonideal be- 
havior with lack of autonomy. ‘This may be 
a dubious equation, but it does not affect the 
present discussion. 
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recordings of role-playing situations, 
was able to obtain an interrater re- 
liability coefficient of .89 for ratings 
of “rigidity”’ of behavior. 

Kelly and Fiske (13), interested in 
predicting clinical psy- 
chology, asked judges to rate sub- 
jects in role-playing situations on 
characteristics which were unrelated 
to the immediate role-playing enact- 
ment. Among these characteristics 
were research ability, therapeutic 
ability, diagnostic ability, and over- 
all suitability for the profession of 
clinical psychology. Interjudge re- 
liability for these characteristics was 
found to be .45, .44, .40, and .51 re- 
spectively. In this study, however, 
not only was the reliability of the 
observer's judgment being measured 
but also the degree of similarity be- 
tween his conception of the abilities 
required for successful performance 
in research, therapy, and diagnosis 
and that of the other judges. In the 
single case in which the individual 
judges’ ratings were averaged and 
corrected by the Spearman-Brown 
formula, the reliability for the given 
variable rose from .51 to .76. 

Borgatta (2) has studied the relia- 
bility of “real” situations and role- 
playing situations. In his study the 
observation system (Bales Interac- 
tion Analysis) was held constant and 
the reliability of the categories in the 
system was measured within the 
types of situations. It was found that 
the kinds of behavior which an indi- 
vidual initiates and receives are about 


success in 


as consistent in role playing as in 
“actual” behavior. 

Almost any system for observing 
human interaction which has been 
devised can be used for observing 
role playing. The reliability asso- 
ciated with such systems is therefore 
related to the present discussion, but 
here we report only the findings de- 
rived from interaction situations 
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specifically labeled role playing. In 
summary, it can be said that the 
above tend to confirm the 
establishing reliable 
systems of observing role-plaving be- 
havior. 


studies 
possibility of 


Studies of validity can be ordered 
in relation to the nearness of the test- 
ing situation to the validating situa- 
tion. Phe follow: discussion ts 
iccording lo 


with 


ordered this principle, 
which the 
nd extending to 


which the 


starting studies in 


two are adjacent 
and 
tenu 


testing 
only 


studies in 
validating situations are 
ously related 

Borgatta (2, 3, 4,5, 6) in a series of 
reports has described the results of 
subjects alternatels 


“actu ills rm 


study in which 


-plaved and 
in an experimental setting 


rol behaved 
The sub 
t were assigned 
nd rotated 
that each 
subject would have equal experience 


jects of this experime: 
to 166 three-man grouns ¢ 


in such a wav as to ensure 


with an array of persons. 

Perhaps the most striking finding 
of this study in relation to the present 
context was that, though the interac- 
ereatly 
ind ut tual situation, 
the performance of group members 
remained remarkabl 


tion profiles varied between 


the role-play ing i 


constant in re- 
Sper ifically the 
correlation between total rate of in- 


lation to each other. 


teraction initiated and received and 


rate ol positive emotional responses 


initiated and received over the two 


situations was .76, .66, .58, and .58 
respectively. A correlation of .17 
was significant at the .05 level.) In 


close parallelism 


addition, a Was 
found between role playing and ac- 
tual behavior over all the Bales Inter- 
action Categories except that of ‘‘ask- 
ing for suggestions’? which was a null 
variable for the actual behavior. 

Certain between 


differences role 
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playing and actual behavior were 
noted however. Role playing was 
characterized by lack of tension and 
inhibition, and by higher rates of ask- 
ing for and giving opinion in compari- 
son to the actual behavior in which 
more emphasis was placed on neu- 
trality of feeling and task orienta- 
tion. It is also interesting to note that 
leadership self-ratings made by sub- 
jects, and leadership ratings made 
by their associates appeared to cor- 
with their per- 
formances in role playing than with 
their performances in the actual be- 
havior situation. 

In a further analysis of these data 
Sorgatta (6) compared role playing 
and actual behavior with behavior on 
a paper-and-pencil projective test 
Conversation Study). The test con- 
sisted of a leaflet in which ten plates 
with three-man 


relate more highly 


situations 
The subject was asked 


various 
were drawn 

to write down what each of the men 
This imagined conversa- 
tion was then analyzed by the Bales 
Interaction Categories in the same 
manner as the role plaving and the 
actual behavior. From these data a 
correlation of .3 was obtained be- 


Was Sa\ ing. 


tween the total rate of response on 
the projective test and that of either 
the role playing or the actual behav- 
ior. Role playing and actual be- 
havior, as previously mentioned, were 
highly correlated. 

his finding was similar to one ob- 
tained in an earlier experiment by 
Borgatta (1 In that pilot experi- 
ment the comparison was made be- 
tween a the 
Rosenzweig P-F Study when given 
normally (in written form), when 
used as the basis for a series of role- 
playing incidents, and as the basis 
for an analogous series of apparently 
unplanned real-life incidents. In this 
earlier experiment, in which 78 sub- 
jects were utilized, the findings were 


subject’s response to 
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not clear because of the low reliability 
associated with the scoring system 
which distributed 24 scores over 11 
categories. 

To clarify further the relationship 
among these three types of situations 
Borgatta (6) subjected his data from 
the later study to a factor analysis. 
Of five relevant factors found, two, 
task ability and emotional assertive- 
ness, showed close parallelism over 
role playing and actual behavior and 
some parallelism over the projective 
test. A third factor, military adjust- 
ment, showed some parallelism over 
role playing and actual behavior, 
with a suggestion of parallelism over 
the projective test. In a fourth factor, 
task supportiveness, parallelism was 
found over role playing and actual 
situation only. Finally in a fifth 
factor, emotional group supportiveness, 
no parallelism was found. 

In his work Borgatta measured 
actual and role-playing behavior 
which occurred in the same experi- 
mental setting. Stanton and Litwak 
(22) expanded the distance between 
these types of behaviors by studying 
the relationship between role playing 
occurring in an experimental setting 
and actual behavior occurring in a 
natural setting. They were interested 
in studying autonomy as an aspect 
of stress-tolerance. They used a test 
which consisted of three standard 
stress situations. Raters were asked 
to note the number of nonideal be- 
haviors which occurred during the 
enactment. The validity of the test 
was determined by correlating the 
measure of autonomy so obtained us- 
ing a group of foster parents as sub- 
jects with ratings of autonomy made 
by social caseworkers who had con- 
siderable contact with the foster par- 
ents. A correlation of .82 was ob- 
tained. A correlation of only .48 was 
found between the caseworker’s rat- 
ing and the rating of the homefinder, 
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who interviewed the foster parents 
in their home. In a small class of 
eight students, a correlation of .79 
was found between ratings of auton- 
omy determined from role playing 
and ratings made by close friends. 

It is also interesting to note that 
while raters who saw only role-play- 
ing scenes referring to autonomy were 
able to make ratings that correlated 
.82 with caseworkers’ ratings, raters 
who saw other role-playing scenes 
in addition, were able to make rat- 
ings which correled only .40 with 
caseworkers’ ratings. This suggests 
that the additional information which 
raters received contaminated, rather 
than clarified, the rater’s perception 
of the role-playing situation. 

Kelly and Fiske (13) attempted to 
use role-playing behavior as a basis 
for making predictions of future suc- 
cess in clinical psychology. In their 
large-scale assessment program they 
utilized a battery consisting of stand- 
ardized paper-and-pencil tests, pro- 
jective tests, interviews, written back- 
ground material, and _ situational 
(role-playing) tests. Two kinds of 
validation are provided by these 
data. First, one can consider the 
correlation between the situational 
test rating and the over-all rating ob- 
tained by the use of the whole test 
battery which, in this case, is the 
validating criterion. It was found 
that observers seeing only the situa- 
tional tests could make ratings as 
good as those made by other observ- 
who read the subject’s back- 
ground credentials and conducted a 
preliminary interview with him, or 
who had seen his background cre- 
dentials, his objective and projective 
test results, and had read his auto- 
biography. Second, one can consider 
the correlation between the situa- 
tional test rating and the criteria 
utilized to establish later successful 
performance in clinical psychology. 


ers 
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The correlation between the over-all 
score based on the test battery and 
the validation criterion was .34. The 
correlation between the situational 
tests and the validating criterion was 
.20. This is not impressive since an 
equivalent correlation was obtained 
by using either the credential file or 
a standard intelligence test 
This low correlation may reflect the 
fact that the role-playing situations 
used were unrelated to the validating 
criterion in content. 

In summary, it can be said that 
there is some evidence which indi- 
cates that valid predictions of inter- 
personal behavior can be made from 
role-playing assessment 


score. 


procedures. 
The actual development of such pro- 
cedures has, to date, been very slow 
and consequently the problems in- 


volved have hardly been explored 
ROLE PLAYING AS A METHOD FOR 


PRODUCING PERSONALITY 
CHANGE 


Role playing has been used to pro- 


and behavioral 
change in a wide variety of settings 
varying from leadership and teacher 
training to psychotherapy with neu- 
rotics and psychotics. Comparatively 
little experimental work has been re- 
ported, though application has been 
extensive. 

Harrow (10) studied the effect of 
role playing on 20 schizophrenics or- 
ganized into two groups that met 25 
times over a two-month period. A 


duce personality 


third group of schizophrenics was 
used as a control. Each individual 
was given the Rorschach, the MAPS, 
anda spec ial role test before and after 
the role-playing series. Although 
statistically significant results were 
not obtained, a group of judges look- 
ing at the before-after data found evi- 
dence of the development of more 
realistic perception of and interest 
in the outside world as well as an in- 
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creased ability to handle personal 
problems. 

Jones and Peters (12) in a similar 
experiment studied the effect of role 
playing on a group of twelve schizo- 
phrenics and a group of eight lobot- 
omy patients. Another group of 
schizophrenics was used as a control. 
The role-playing sessions were held 
once a week for four months. Before- 
after evaluation was made using the 
Porteus Maze, the Mirror-Tracing 
Test, Gardner Behavior Chart, Ror- 
schach, and the Draw-A-Man Test. 
Again, although statistically signifi- 
cant results were not obtained, it was 
found that all tests except the Draw- 
A-Man Test indicated improvement. 

Sause (21) has investigated the ef- 
fect of role playing on a group of 15 
normal student teachers. At the be- 
ginning of each role-playing session 
the group members selected a prob- 
lem related to student-teacher inter- 
the focus of the session. 
They then each wrote down their 
solution to the problem. Following 
this they role-plaved the problem 
and again wrote down their solution. 
Finally they rated their satisfaction 
with the meeting. Four judges rating 
the written solutions before and after 
a given session were 
practically unanimous in finding an 
improvement in the solutions after 
role playing. The judges agreed with 
each other 80 per cent of the time. It 
was also found that the quality of the 
solutions reached before role playing 


action as 


r¢ le-plar ing 


improved as the sessions progressed. 
Eighty-seven per cent of the student 
teachers were rated as having better 
solutions before role-playing at the 
twelfth than at the third, 
indication that a transfer of training 
may have occurred 

Maier (15), also working with nor- 
mal groups, has tested the effect of a 
specialized form of role playing (Mul- 
tiple Role Playing) in helping groups 


meeting 
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reach satisfactory solutions to com- 
mon problems. Using this technique 
he found that 42 out of 44 groups 
reached a solution, and that only 
four per cent of the group members 
were dissatisfied with the solution 
reached. 

Because of either small sample size, 
design, or lack of underlying relation- 
ship, none of the four preceding stud- 
ies provide statistically significant 
results.< In addition, two of them fail 
to utilize a control group. It is best, 
therefore, to consider them as _ ex- 
ploratory in nature. 

A more satisfactory study from the 
methodological viewpoint was that 
of Janis and King (11). In their de- 
sign they utilized 90 college students 
divided into groups of three. Each 
student gave a short talk to the 
other two on an assigned topic. Each 
student, therefore, spoke once and 
listened twice. The 


topics were 


assigned in such a way that the stu- 
dent speaking presented a point of 


view contrary to his own as deter- 
mined by a previous opinion ques- 
tionnaire. At the end of each speak- 
ing session (i.e., role-playing session 
the opinions of the listeners and the 
speaker were reassessed. 

The results indicated that, for two 
of the speaking topics used, there was 
significantly more change of attitude 
in the speaker than in the listener. 
For the third topic the change was 
not significant, but the speakers ex- 
pressed confidence in the 
sureness of their opinions than the 
listeners. It was also found that in- 
dividuals who improvised most and 
who were satisfied with their per- 
formance as speakers underwent the 
greatest amount of opinion change. 
There are two difficulties in generaliz- 
ing the results of this study. First, 
speaking on an assigned topic before 
two other people is a very limited 
form of role playing. It involved only 


greater 
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a Ohe-way interaction between the 
active speaker and the passive audi- 
ence. The typical role-playing situa- 
tion consists of at least two-way ac- 
tive interaction. Second, the opin- 
ions changed were unimportant. 
They dealt with the number of movie 
theaters that would exist in three 
years, the size of the probable meat 
supply in the following vear, and the 
length of time that would be required 
to find a cure for the common cold. 
It is probable that these 
topics was of central importance to 
the students tested. If this had not 
been the case the students might have 
shown greater resistance to change 

The results obtained by Janis and 
King were replicated by Rosenberg 
(19) under less artificial conditions 
Using three 
situations 


none ot 


standard role-plaving 
with three groups of 15 
persons each, group members were 
rotated three experimental 
positions, role players, identifiers (in- 
structed to identify with the 
players), and active observers (in 
structed to watch the action as ob- 
jectively as possible). A 
group of control observers 
signed simply to watch the action. 
The role-playing situation involved 
such topics as labor-management re- 
lations, guidance problems, etc. After 
each role-playing session a question- 


among 


role 


special 
Was as- 


naire Was given to all group members. 
This was followed by a carefully ob- 
served nondirective group discussion 
about the role playing. Using data so 
gathered it was found that the role 
players were most susceptible to 
change. They were also found to be 
the most productive in the discussion 
after the role playing, to have cer- 
tain emotional biases of judgment 
and perception, and a tendency to 
support their own actions in the role 
plaving. The identifiers were found 
to be most critical, almost as produc- 
tive as the role players during the dis- 
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cussion, and to have an awareness of 
the dynamics of the interaction proc- 
ess second only to the role player. 
The active observer was found to be 
uninvolved and therefore unemotion- 
al with reference to the enactment. 
He was as critical as the identifier but 
relatively unaware of the dynamics 
Finally, 
the control observer was found to be 
indifferent, unproductive during the 
discussion, and relatively unaware of 
An inter- 
esting implication of this study is 
that understanding of the dynamics 


of the interaction process. 


the interaction dynamics. 


of interaction processes is associated 
with involvement in those processes. 
It is also interesting to note that in- 
dividual involvement was found to be 
a function of position assignment. 
While 
with the differences induced by ex- 
perimental position within the role- 
plaving situation, she did not con- 


Rosenberg was concerned 


sider differences between individuals 
These differ- 
studied by Luszki 
14) who compared observers who 
the in- 
dynamics in a role-playing 
She 


in the same position. 


ences have been 


were good at understanding 
teraction 
scene with those who were not. 
found that 
tinguished from poor ones, (a 


good observers, as dis- 
were 
others 


were goc rd 


good at determining how 
looked at 
at determining how others looked at 
them, (c) were well adjusted, (d) eval- 
uated themselves as others evaluated 
them, and (e) were consistently and 
favorably perceived by others. Fur- 
ther it was found that persons who 
were good judges of how others felt 
about them were able to identify most 
successfully with role players. 


themselves, (6 


Per- 
sons who were good judges of how 
others felt about themselves were 
found to be good at observing the 
actions of role players critically. 
Brown (8) has explored the effect 
of the role-playing situation on the 
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success of the role plaver’s perform- 
ance. He did this by asking his sub- 
jects to reverse roles with each other 
and having them evaluate how well 
the other person succeeded in por- 
traving them. It was found that suc- 
cess was inversely related to the real- 
ism of the role-playing situation. The 
more unusual the situation in which 
the role was portraved, the better 
was the portrayal. 

In summary, the findings in this 
area are sketchy and essentially sug- 
gestive in nature. The evidence indi- 
cates that role playing may produce 
behavioral and personality change, 
but this has certainly not been estab- 
lished with anv degree of confidence. 
The evidence also suggests that the 
effect of role playing on the individual 
is related to his personality and to his 
position in the role-plaving situation. 
In addition, the success of the role 
plaver may be related to the setting 
of action in the role-playing situa- 
tion. 


SUMMARY 


Perhaps the most striking impres- 
sion to be gained from a review of the 
experimental studies of role playing 


is their scarcity. Except for the few 
studies reported here, effort among 
practitioners of role playing seems to 
have been devoted to the exploita- 
tion of its various applications. 

With reference to personality as- 
sessment, there is some sound evi- 
dence for believing that reliable and 
valid role-plaving tests can be de- 
veloped. Such tests are, however, 
still in their infancy, and it is toward 
the development of these tests that 
future effort might profitably be 
directed. 

With reference to personality 
change, it can only be said that there 
is as yet little supportive evidence. 
The studies which do exist suggest 
the possibility of such change and in- 
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reach satisfactory solutions to com- 
mon problems. Using this technique 
he found that 42 out of 44 groups 
reached a solution, and that only 
four per cent of the group members 
were dissatisfied with the solution 
reached. 

Because of either small sample size, 
design, or lack of underlying relation- 
ship, none of the four preceding stud- 
ies provide statistically significant 
results. In addition, two of them fail 
to utilize a control group. It is best, 
therefore, to consider them as ex- 
ploratory in nature. 

A more satisfactory study from the 
methodological viewpoint was that 
of Janis and King (11). In their de- 
sign they utilized 90 college students 
divided into groups of three. Each 
student short talk to the 
other two on an assigned topic. Each 
student, therefore, spoke once and 
listened twice. The 


gave a 


topics were 


assigned in such a way that the stu- 
dent speaking presented a point of 


view contrary to his own as deter- 
mined by a previous opinion ques- 
tionnaire. At the end of each speak- 
ing session (i.e., role-playing session 
the opinions of the listeners and the 
speaker were reassessed. 

The results indicated that, for two 
of the speaking topics used, there was 
significantly more change of attitude 
in the speaker than in the listener. 
For the third topic the change was 
not significant, but the speakers ex- 
pressed greater confidence in the 
sureness of their opinions than the 
listeners. It was also found that in- 
dividuals who improvised most and 
who were with their per- 
formance as speakers underwent the 
greatest amount of opinion change. 
There are two difficulties in generaliz- 
ing the results of this study. First, 
speaking on an assigned topic before 
two other people is a very limited 
form of role playing. It involved only 


satisfied 
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a one-way interaction between the 
active speaker and the passive audi- 
ence. The typical role-playing situa- 
tion consists of at least two-way ac- 
tive interaction. Second, the opin- 
ions changed were unimportant. 
They dealt with the number of movie 
theaters that would exist in three 
years, the size of the probable meat 
supply in the following vear, and the 
length of time that would be required 
to find a cure for the common cold. 
It is probable that none of these 
topics was of central importance to 
the students tested. If this had not 
been the case the students might have 
shown greater resistance to change. 

The results obtained by Janis and 
King were replicated by Rosenberg 
(19) under less artificial conditions. 
Using three 
situations 


standard role-plaving 
with three groups of 15 
persons each, group members were 
rotated among three experimental 
positions, role players, identifiers (in- 
structed to identify with the role 
players), and active observers (in 
structed to watch the action as ob- 
jectively as possible). A 
group of control Was as- 
signed simply to watch the action. 
The role-playing situation involved 
such topics as labor-management re- 
lations, guidance problems, etc. After 
each role-playing session a question- 
naire was given to all group members. 
This was followed by a carefully ob- 
served nondirective group discussion 
about the role playing. Using data so 
gathered it was found that the role 
plavers were most susceptible to 
change. Thev were also found to be 
the most productive in the discussion 
after the role plaving, to have cer- 
tain emotional biases of judgment 
and perception, and a tendency to 
support their own actions in the role 
plaving. The identifiers were found 
to be most critical, almost as produc- 
tive as the role players during the dis- 


special 
observers 
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cussion, and to have an awareness of 
the dynamics of the interaction proc- 
ess second only to the role player. 
The active observer was found to be 
uninvolved and therefore unemotion- 
al with reference to the enactment. 
He was as critical as the identifier but 
relatively unaware of the dynamics 
of the interaction process. Finally, 
the control observer was found to be 
indifferent, unproductive during the 
discussion, and relatively unaware of 
An inter- 
esting implication of this study is 
that understanding of the dynamics 


the interaction dynamics. 


of interaction processes is associated 
with involvement in those processes. 
It is also interesting to note that in- 
dividual involvement was found to be 
a function of position assignment. 
While 
with the differences induced by ex- 
perimental position within the role- 
plaving situation, she did not con- 
sider differences between individuals 


Rosenberg Was concerne d 


These differ- 
ences have been studied by Luszki 
(14) who compared observers who 
were good at understanding the in- 
teraction dynamics in a role-playing 
She 


in the same position. 


scene with those who were not. 
found that 
tinguished from poor ones, (a 
good at determining 
looked at themselves, (6) were good 
at determining how others looked at 
them, (c) were well adjusted, (d) eval- 
uated themselves as others evaluated 
them, and (e) were consistently and 
favorably perceived by-others. Fur- 
ther it was found that persons who 
were good judges of how others felt 
about them were able to identify most 
successfully with role plavers. 


good observers, as dis- 
were 


how others 


Per- 
sons who were good judges of how 
others felt about themselves were 
found to be good at observing the 
actions of role players critically. 
Brown (8) has explored the effect 
of the role-playing situation on the 
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success of the role player’s perform- 
ance. He did this by asking his sub- 
jects to reverse roles with each other 
and having them evaluate how well 
the other person succeeded in por- 
traving them. It was found that suc- 
cess Was inversely related to the real- 
ism of the role-playing situation. The 
more unusual the situation in which 
the role was portrayed, the better 
was the portrayal. 

In summary, the findings in this 
area are sketchy and essentially sug- 
gestive in nature. The evidence indi- 
cates that role playing may produce 
behavioral and personality change, 
but this has certainly not been estab- 
lished with any degree of confidence. 
The evidence also suggests that the 
effect of role playing on the individual 
is related to his personality and to his 
position in the role-plaving situation. 
In addition, the success of the role 
plaver may be related to the setting 
of action in the role-playing situa- 
tion. 


SUMMARY 


Perhaps the most striking impres- 
sion to be gained from a review of the 
experimental studies of role plaving 


is their scarcity. Except for the few 
studies reported here, effort among 
practitioners of role playing seems to 
have been devoted to the exploita- 
tion of its various applications. 

With reference to personality as- 
sessment, there is some sound evi- 
dence for believing that reliable and 
valid role-playing tests can be de- 
veloped. Such tests are, however, 
still in their infancy, and it is toward 
the development of these tests that 
future effort might profitably be 
directed. 

With reference to personality 
change, it can only be said that there 
is as yet little supportive evidence. 
The studies which do exist suggest 
the possibility of such change and in- 
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dicate a few of the relevant variables 
that may be involved. Much effort 
is needed in the exploration of role 
playing as a method for producing 
change. If the possibility of such 
change were established a number of 
related topics become pertinent. For 
example: What personality charac- 
teristics are most affected bv role 
playing? Are all role-playing tech- 
niques equally effective? Is role play- 
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Despite the somewha ssimisti ; viey v Watson (127) ap- 
cally worded summary in our last peared in the interval since our last 
review of the W-B? (97), the av . Our attempt will, again, be 
lanche of research studies with thi to make an exhaustive and critical 
i has abated very little. imination of » studies in this 
ny of these would probably 
to that. his fa not have n published had authors 


isa testamet , ality of the 11 and editors heeded the recommend 


sca> 


attests 


strument a he tenacity of ions of our previous review. We have 
numerous users, ially as a me taken care to prepare a complete 
dium of research. dies, again, ibliography so that taken with the 
cover the entire range of interests, lier two reviews (96, 97) the litera- 
with a weakening in ‘areas al ure should be 

is 1 The organization of the present 
view resembles closely that of the 
st half deals 
as a test of general 
whereas the second halt 

is devoted 
ifications of 


THE WECHSLER-BELLEVUE AS A 
TEST OF GENERAL INTELLIGENCI 
Studies of Reliability 


Recent interest practice effects 


in 

and the dependability of subtest 
} 
il 


scores has filled the previously ob 


served lacunae of reliability 
the W-B. Most ol b cr 
viewed studies present rehabili 
ethcients for the subtests as 


Irrel 


tively. The two-letter combinations PA, PC, for the three IQ scales. 

OA, BD, and DS correspond to the following Test-retest agreement. The two 
subtests: Picture Arrangem nt, Picture ‘om- studies bv Steisel (117. 118) invest 
pletion, Object Assembly, Bl lesion, al 

Digit Symbol. FS, V 1Q, a: ) stand for . ; z 
Full Scale. Verbal 10, an TE W-B scores. Subjects were 17 


respectively. matched pairs of college girls of su- 


gated the effect of practice on the 
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perior intelligence. The first of each 
pair was retested after 14 days while 
the retest interval for the second of 
each pair was 77 davs. Practice 
effects on all three scales appeared for 
both groups. Mean increase in FS 
IQ was eight to nine points. The P 
IQ increases were approximatels 
double those for the V IQ. Steisel 
(117) points out that * . ina retest 
situation the examiner should place 
more trust in the verbal scores... .” 
Differences in the retest scores after 
the short and long intervals revealed 
that only the score on the A subtest 
was a function of the recency of ad- 
ministration. The author concludes, 
“The findings in general corroborate 
those of previous studies in indicating 
that the significant gains in retest 
scores are maintained up to approxi- 
mately three months... ."’ A simi- 
lar study, but with mental defectives, 
is reported by Hays and Schneider 
(56). They emploved different retest 
periods of two, four, six, 
weeks. 
the different retest intervals do not 
affect the IQ increases differentially. 
The W-B Form I FS 1Q showed an 
increase of 7.6 points due to practice 
as compared with only 4.4 for the 
Form II. 

Split-half reliability. 
and De Haan (129 
split-half reliability coefficients 
psychotics than for normals, thev 
provoked much thought. Helmick 
(57) criticizes their failure to equalize 


and eight 
The results also suggest that 


iss 


When Webb 
reported higher 
for 


the range of talent for the two groups. 
Webb (128) confidently replies that 
Helmick’s suggestion of equalizing 
the ranges of talent would create a 
hybrid sample which was unreal and 
could never be representative of an 
actual population of people. 

Perhaps less interesting but more 
important than these asides are the 
main features of the study. Webb 
and De Haan (129) employed 100 
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subjects to provide reliability coeffi- 
cients for the subtests. All the sub- 
coefficients were statisticalls 
significantly different from zero for 
the paranoid subjects, while only PA 
failed to show significance with the 
normal half of the sample. The V 
subtest was the most reliable for both 
samples. Only two subtests showed 
much difference in reliability for the 
different subject sampling. These 
two subtests, PA and PC, were more 
reliable for the psychotics. Botwinick 
(14) utilized Webb and De Haan’s 
normal group, a group of 50 older nor- 
mals, and a group of 31 patients with 
mental disorders of the senium. Un- 
fortunately sex could not be held con- 
stant for all three groups. Corrected 
split-half coefficients were calculated, 
and only C failed to differ signifi- 
cantly from zero. The author's find- 
ings were characteristic for this type 
of study. The coefficients were larger 
for the older subjects and were still 
greater for the senile disordered 
group. As the subtest variability in- 
due to aging or mental 
changes associated with illness, the 
reliability coefficients seem to in- 
crease in magnitude. The reliability 
of some subtests is not as high as 
might be desired. Thus, it is not sur- 
prising that the authors (14, 129) are 


skeptical of emploving 


test 


creases 


diagnostic 


pattern analyses that are based upon 
the more unreliable of the subtests. 
Summary. The additional research 


covered in the current section pro- 
vides sufficient information to guide 
the clinician. The studies have been 
well designed and are fairly conclu- 
sive. The IQ scales seem reliable, al- 
though subject to practice effects. 
The practice-effect increment does 
not depend greatly upon the size 
of the test-retest interval provided 
the retesting occurs within three 
months. Practice effects are about 
twice as great on the performance 
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Split-half reliability coefh- 
cients for the various subtests show 
quite a range of values. Some of the 
smaller coefficients cast doubt upon 


section. 


the usefulness of diagnostic pattern 
analysis which is based upon the more 
unreliable subtests. Additional work 
in the area of reliability should be 
well designed and comprehensive. 
Correlation With Other Tests 

The W-B is 
compared with the Form II in a com- 
prehensive study by Gerboth (45). 
results that when 
Form II was administered first there 
Was a significant increase in the subse- 
quent W-B scales scores. The s« ores 
were uniformly higher by about five 
points for all scales. Surprisingly, 
Havs and Schneider (56) found this 
W-B scores when 
preadministered the Form II. 
This is surprisi ig, because instead of 


: ae 4 
Other Wechsler scales. 


Retest disclosed 


same elevation of 
they 


normal superiors they employed men- 
tal defectives. For some reason pre 
administering the Form | 
elevate the subsequently obtained 
Form I] as much. A 
satisfactory explanation of this spe- 


does not 


scores nearly 


cial effect has not been forthcoming. 
Returning to Gerboth’s 
(45), the intercorrelations between 
the subtests suggest that the J, C, A, 
and PA subtests were least compa- 
rable between the W-B and the Form 
I]. The correlation between FS IQ's 
5. This ts fairly high 
nge of talent was only half 
that for an unrestricted sample. 
Another Wechsler scale, The Wechs- 
ler Intelligence Scale for Children 
(WISC), was compared with the W-B 
bv Delattre and Cole (31). Witha 
good range of intelligence they found 
a correlation of .87 between the FS 
10's.’ The mean FS IQ was six points 
higher for the WISC. _ Intercorrela- 
tion of corresponding subtests of the 
two tests provides indices of compa- 


study 


-@ 
/ 


was about . 
since the ra 
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rability. PA was particularly low with 
an rof only .19 between the two tests. 
The rather moderate correlations pro- 
voke the authors to wisely invoke 
caution in applving W-B patterns 
and signs to WISC data. An almost 
identical with a narrower 
range of intelligence is found in a 
more recent study by Knopf, Mur- 
fett, and Milstein (68). Their results 
provide confirmation of the 
earlier findings of Delattre and Cole 
(31). They also emphasize the lack 
of comparability between  corre- 
sponding subtests of the two tests. 
Vanderhost, Sloan, and Bensberg 
(124) report on the gross compara- 
bility of the W-B and the WISC for 
a mental defective group. Their ex- 
perimental neatly counter- 
balances extraneous factors. With a 
restricted range of 1O they report a 
correlation of .72 between FS IQ's. 
They found the mean WISC to be 
1.6 IQ points higher than the W-B, 
a difference not statistically signifi- 
cant. 


design 


good 


design 


Alderdice 
and Butler (3) found a correlation of 
.69 between the W-B FS and the 
Revised Stanford Binet (S-B), Form 
M. However, the correlation is some- 


Other intelligence tests. 


what attenuated by the narrow range 
of 10's in the sample. Correlations 
between the W-B and the S-B are 
higher when an appro- 
priate range of talent is emploved 
(96, 97). As encountered in the past, 
the W-B 10 was found to be some- 
what higher than that for the S-B (3). 
Dunean (37 
verting W-B vocabulary 
S-B vocabulary scores, 


considerably 


presents a table for con- 
scores to 
therebv elimi- 
nating the necessity of administering 
both vocabularies, when both tests 
are included in a battery. 

The Army General Classification 
Test (AGCT) was compared with the 
W-B by Tamminen (121). A correla- 
tion of .91 was obtained between the 
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two tests when the sample was cor- 
rected for range of talent. The AGCT 
appears to be more closely related to 
the V 10 than to the P 1Q of the W- 
B. The American Council on Educa- 
tion Test (ACE) correlated .61 with 
the W-B FS with a restricted range 
of 1Q’s in the sampling by Merrill and 
Heathers (86). Gerboth (45) reports 
a comparable correlation between 
the ACE and the W-B but also em- 
ployed a narrow range of talent. A 
thesis by Smith (109 the 
relationship between the SRA Pri- 
mary Mental Abilities Test but was 
not available for review. 

Knott ef al. (69) compared the W-B 
with the seven-test Kent B ittery. 
The correlation with the FS 1Q was 
.87, but this is difficult to interpret 
since the range of talent of the sample 
was not reported. Delp (32) com- 
pared only the EGY of the Kent Bat- 
tery with the W-B. The correlation 
was .65, and the range of talent was 
only slightly, reduced. Raven's Pro- 
gressive Matrices correlated .55 with 
a short form of the W-B according to 
Levine and Iscoe (71). 1 hey state, 


reports 


“The Progressive Matrices a ypeared 
PI 


to tap areas of intelligence most 
closely related to BD and not signifi- 
cantly related to C."" In another 
study Levine and Iscoe (72) found 
the correlation between the W-B and 
the Matrices “... not high enough 
for individu: ” but 
this was with a deaf sample. Desai 
(33) compared the Matrices against 
just the |’ 10 of the W-B. He ob- 
tained an r of .65 which was corrected 
for attenuation. Allen, 
and Stenger (5) found the surpris- 
ingly high correlation of .86 between 
the Ammons Picture Vocabulary and 
the W-B. Rubin (101) presents the 
results of his comparison of the W-B 
and quantitative features of the 
H-T-P. With an appropriate range 


OTE « so 5 


| pre dict 


Thornton, 


e 
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of talent he obtained a correlation oi 
.67. The mean 1Q’s were rather com- 
parable. 

Other Storrs (119) re- 
ports interesting relationships be- 
tween the W-B scales and the Gen- 
eral Aptitude Test Battery (GATB). 
The Battery showed the 
verbal, and numerical factors to be 
related to the V IQ. The test-factor 
related to P IQ were spatial, form 
perception, clinical perception, aim 
ing, and motor speed. A correlation 
of .58 was found for the relationship 
between the total score for the Bat- 
and the W-B FS. The correla- 
tion with the P section was slightly 
higher than with the FS. 

interest in the correspondence ‘be- 
tween W-B and intellectual aspects 
of 
countered in the work ot Abrams C23; 


measures. 


interest, 


tery 


Rorschach performance is en- 
A multiple-regression equation is pre- 


‘| he 


are 


sented which predic ts 10 score. 
Rorschach elements involved 
F+%, M, W, and R. The multiple 
R is .53. 

Only two studies compared W-B 
scores with criteria other than tests. 
45) found that both forms 
of the W-B were significantly related 
rage in school. The 
correlation for Form I was .29, while 
that for Form II was .26. These 7's 
based upon a narrow range of 
Although Merrill and Heath- 
ers (86) also sampled intelligence nar 
rowly they obtained a correlation of 
.58 between W-BV 1Q and grade-point 


average. 


Gerboth 


to grade-point ave 


are 


talent. 


Summary. As time passes the W-B 
and its later forms seem to attain an 
venerable position. It 
is now the well-accepted standard ot 
intelligence evaluation for adults. As 
many tests of varied composition are 
correlated with the W-B it becomes 
clear that the W-B occupies a central 
position in evaluating the factor of 


increasingly 
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general intelligence. When correc- 
tions are made for the ever-present 
restricted ranges of talent, the cor- 
relations between the W-B and other 
tests argue well for both the reliabil- 
ity and validity of the test. 

The principles of test construction 
employed with the W-B have been so 
successful that an alternate 
(132) and a children’s 


have been developed. 


form 
scale (133 
These two 
torms of the test seem to be compara- 
ble to the W-B in their capacity to 
evaluate general intelligence. How 

ever, the extension of W-B-derived 
subtest pattern interpretations to 
these tests has not been too well suy 

ported. 

The authors once again bemoan th« 
failure of investigators to report the 
range of intelligence of the samples. 
that so 
few of the writers correct their coeffi- 


] 


We are especially distressed 
cients for range-of-talent. Too often 
is Meaningless as 
presented in the report. 
the tests in the file determine the 
study rather than the 1 


the raw correlation 


Too oiten 


the 
y]- 


The sampling of one study 


ature of 


study determining the data to be cc 
lected. 
(31) is exemplary, whil 
show much thought behind the ma- 


still others 


nipulation of experimental variables 
and the statistical analyses (3, 31, 45 
56). 


Short Forms 


Surprisingly, only three papers (3, 
29, 53) propose new short-form com 
binations of W-B subtests. First of 
these is Gurvitz’s (53) thorough sift- 
two-subtest com- 
With a large sample the 
I-DS proved best and was confirmed 
upon 


nye ol all possible 


binations. 


The sec ond 
of these proposals is the practical one 
by Cotzin and Gallagher (29). They 
show that the four-test form C-S-PA- 
BD correlates .94 with the FS for a 


cross validation. 
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cross-validation group of defectives. 
Both the V 10 and P 1Q can be esti- 
mated from this form, which is felt 
to be of particular importance when 
working with mental defectives. 
Their regression equations provide 
appropriate raw scale conversions to 
1Q. Last is the proposal by Alderdice 
and Butler (3), who developed the 
I-D-S-OA that S-B 
could be predicted tot 
mental defectives. 

Previously proposed short forms 
are evaluated in an additional three 
papers (58, 59, 75). McKenzie (75) 
studied the V-J-S-BD form proposed 
by Kriegmen and Hansen. 


combination so 
best 


scores 


He con- 
cludes this form is a good means of 
estimating the intelligence of mental 
defectives and that it provides more 
similar 
nstruments. In the serie year both 
Herring (58 Hilden (59) re- 


ported thorough empirical investiga- 


diagnostic information than 


and 


tions of previously proposed forms. 
Herring studied the short forms with 
both met patients 
nd normals while Hilden used only 
clinic patients. The 
more extensive study by Herring (58) 
indicates the best two, three, four, 
| 


and tive subtest combinations, 


ital-hvgiene-clinic 


mental hygie ne 


Summary. This review covers only 
one third 


with short forms 


as many studies dealing 
as appeared in the 
previous review (97 Instead of a 
deluge of brief unrelated studies it is 
gratifving to encounter well-planned 
compre hensive studies (3, 53, 58, 59). 
wo of the reports (58, 59) involved 
considerable effort directed at bring- 

¢ about an integration of work in 
the area of short forms. One of the 
proposed short forms (29) is the first 
to vield an estimate of both V IQ and 
P 1Q tor use with defectives. Sam- 
pling problems such as range of talent 
and validation have also re 
ceived more consideration than for 


cross 
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merly. This greatly increased sophis- 
tication is most encouraging. 


Applications With Special Population 


Sex differences. A comprehensive 
study by Norman (89) explores sex 
differences in young superiors. Many 
significant differences in subtests as 
well as for the V and P IQ's were en- 
countered. In general, men have 
higher scores than women, and the 
greatest difference is apparent on A, 
This relatively large sex discrepancy 
in A was encountered in college stu- 
dents by Guertin (51) and proved 
puzzling. Vane and Eisen (125) found 
that both delinquent and normal girls 
had lower subscores on A and D than 
the standardization population of 
Wechsler. Strange and Palmer (120) 
employed psychiatric clinic patients 
for their study of sex differences. 
Their results agree with Norman’s 
positive findings as do the more re- 
cent conclusions of Goolishian and 
Foster (48) using neuropsychiatric 
patients. Brown and Bryan (18) dif- 
fer in that their normals do not show 
the superiority of the males so clearly, 
and patterning is somewhat different 
than found by other authors. 

Miscellaneous populations. Geriat- 
ric changes in W-B performance were 
studied by Birren (13) in an unusual 
way. Instead of merely evaluating 
subtest scores he studied the factor- 
scores of geriatric subjects. His Ver- 
bal Comprehension Factor held up 
well with advancing age, while his 
Closure Factor (performance) showed 
sizable decrements. As a final com- 
ment Birren states, “It is apparent 
that the W-B does not include tests 
of many of the known primary abili- 
ties. For this reason, it is of limited 
use in attempting to describe age 
changes in the intellect.” 

Colored and white neurotics were 
matched for the usual variables by 
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Davidson et al. (30). The A subtest 
and performance scores were lower 
for the colored. Goldstein (47) dis- 
cusses problems encountered in using 
the W-B in South Africa. <A great 
many changes had to be made to 
adapt it to this population. Maleci 
and Montanari (79) present a similar 
report on the adaptation of the test 
for use in Italy. A report from Cal- 
cutta University (21) cites the same 
sort of problems in adapting the test 
for India. V and PA required the 
most alteration. Aeppli-Tanner (2) 
found she could employ a German 
translation with adolescents 
“without extensive alterations.”’ 
Gross quantitative results compared 
well with American norms; however, 
subtest discrepancies were present. 

While not too much has been pub- 
lished about the use of the W-B in 
vocational counseling, a few studies 
appeared during this review interval. 
Of particular importance is Patter- 
son’s (93) guide for counselors. Mean 
W-B scores are presented for each of 
16 broad occupational groups by 
Simon and Levitt (108). Another by 
Ladd (70) discloses higher V IQ's for 
students in academic teacher-prepa- 
ration areas as compared with non- 
academic teacher-preparation areas. 
Those in the nonacademic § areas 
showed higher P IQ's than those in 
the academic areas. The authors cau- 
tiously suggest there might be some 
implications for counseling. Merrill 
and Heathers (84) present a norma- 
tive study setting forth W-B centile 
scores for a college counseling-center 
group and also for college student 
volunteers. They found little differ- 
ence in the 1Q’s for the two samples. 
They concluded that those with per- 
sonal difficulties were not likely to be 
significantly lower on the W-B. 

Summary. Sex differences 
normals and_ psychiatric 


Swiss 


with 


subjects 
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seem to be rather well established. A 
study employing a_ representative 
sample of the general population to 
identity these sex differences appears 
much needed at this time. It would 
be of interest to see whether sex dif- 
ferences appeared in the standardiza- 
tion results of the new Wechsler 
Adult Intelligence Scale. Perhaps dif- 
ferences in performance by the sexes 
at one level of intelligence are re- 
versed in direction at a different level 
so that the total mean differences in 
test performance are minimal.  Dif- 
ferences in A, D, and PC are repeat- 
edly encountered (48, 89, 120). Re- 
searchers and clinicians should take 
heed of this and Spec ify sex as well as 
age when proposing patterns or ap- 
plying pattern analyses. 

Some of the more familiar popula- 
tions of past research have received 
minimal while 
2, 47, 49, 


It seems appropriate 


attention recently, 
some new interests 


70, 79, 84). 
that as the problems of the past are 


appear 


resolved, new interests should appear. 
Accompanying 
extension of the W-B to foreign popu- 
lations. 


this progress is the 


Refinements and Critique: 

Administration and scoring. Kit- 
ind Blumberg (66) present a 
supplementary guide for administer- 
ing and scoring the W-B. It includes 
scoring principles and examples as 
well as good descriptions of conven- 
tional subtest rationale. Cohen (23 
was interested in scoring and admin- 


zinger 


istration biases. He studied samples 
of W-B’s from 13 examiners and was 
able to identify evidence of examiner 
biases that would serve to reduce re- 
liability. Those subtests that show 
lowest reliability were those that pro- 
duced the most systematic interex- 
aminer disagreement. This raises the 
possibility that much of the varia- 
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bility of subtests that is ascribed to 
unreliability may be systematic var- 
iance related to the nature of the ex- 
amuner. 

Item order of the various subtests 
again receives some consideration by 
Norman (89). An incidental finding 
of his larger study was that the order 
of dithculty for superiors agreed bet- 
ter with the order found by Jastak 
than with Wechsler’s (131). Mech 
83) presents the order of difficulty 
for high school students. He suggests 
that the item order be changed and 
then appends the statistically wise 
but impractical condition that this 
‘... applies only to students in the 
12th grade.’’ Mech is faced with the 
; of establishing an order of dif- 
that will apply for any group 
of subjects. It would seem necessary 
to revert to Wechsler’s intent of 
establishing the item order on a repre- 
sentative population. Different or- 
difficulty found with small 
restricted portions of the whole popu- 
lation would not then be a basis for 

lenging the originally proposed 

er. Russell (102) 


vised sequence of V 


enigma 
l¢ 


CUI 


ders of 


proposes a re- 
words for use 
with neuropsychiatric patients. 
Other subtests have been studied. 
For example, Luchins and Luchins 
74) examined the effect of varying 
the instructions on the DS subtest. 
This well-designed study permits the 
initial test 
score cannot serve as a reliable index 
of learning, (6) speed emphasis may 
(c) Wechs- 
ambiguous 
that they produce greater variability 
among the subjects as compared with 
the experimental directions. Guertin 
(51) was interested in a different sub- 
test. He studied the effect of instruc- 
tions and item order upon A. Con- 
trary to the investigator’s expect- 
ancy, the superior subjects of the 


following conclusions: (a) 


retard performance, and 


ler’s directions are so 
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study were not threatened by en- 
countering difficult items early in the 
presentation. In fact, they accepted 
the challenge and performed better 
than with conventional order of pres- 
entation. It seems possible that men- 
tally subnormal subjects might show 
just the opposite reaction, but this 
has yet to be verified. 

Eglash (38) studied the relation- 
ship between the FS score and the 
shoes item on C. He finds some justi- 
fication for relating the number of 
reasons given to intelligence. How- 
ever, the spread between one and two 
reasons is that it would be a 
more appropriate cutting point than 
the one in current use. 


such 


Following up 
on this suggestion, Armstrong (8) re- 
scored the shoes item, and the r be- 
tween C and the FS rose from .42 to 
48. The OA subtest was the subject 
of the article by Shannon and Rossi 
(106). They suggest a presentation 
method for OA that was completely 
described several vears earlier by 
Derner and Aborn and was covered 
in our last review. 

Statistical analysis. An item anal- 
vsis with high school students was 
made by Mech (83). Item order and 
discriminative values are presented 
for each item. Stanley (115) discusses 
what he feels 
should know.” 


good clinician 
The article deals with 
what to tell the psychiatrist when the 
V 10 and P 1Q are both larger or 
smaller than the FS 1Q. A rather 
mathematical description of the fal- 


“every 


lacy of averaging averages points out 
that averaging the V 10 and P 10 


ac 


vields a meaningless “average” IQ 
that could not be expected to cor- 
respond to the FS IQ 

Cohen (24, 26) cleverly factor- 
analyzed the W-B_ subtests using 
three types of psychiatric patients. 
He makes a methodological contribu- 
tion by demonstrating that the fac- 
tors underlying success on a test are 
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a function of the sample of people 
taking the test. The material is pre- 
sented in detail in a relatively non- 
technical fashion in another paper 
(25), and it is highly recommended to 
clinicians. Cohen identified the three 
factors: Verbal, Nonverbal, and Dis- 
tractibility. Similar factors were 
derived by Birren (13) for a group of 
elderly subjects. His factor names 
were respectively: Verbal Compre- 
hension, Closure, and Rote Memory. 
In addition a fourth factor is tenta- 
tively identified as Induction. Alder- 
dice and Butler (3) factor-analvzed 
the performance of mental defectives 
but found only a general and a per- 
formance factor. Whiteman and 
Whiteman (137) report on the appli- 
cation of Jastak’s predetermined 
clusters: Reality perception, Psycho- 
motor efficiency, 
and: Motivation. 


Language polarity, 
Wheeler (136) also 
reports the results of a factor analysis 
of the W-B and other instruments. 
Summarizing, there seems to be a 
fairly good overlap of factors with a 
few factors appearing only with cer- 
tain subjects. Most frequently en- 
countered are a verbal, a perform- 
ance, and an attention factor. 
Miscellaneou Gurvitz (54) 
reached his limit of professional tol- 
erance and wrote a critical article on 
the defects in standardization sam 
pling and procedure for the W-B. 
The criticisms are too many to list 
and reading of the original article is 
essential for anyone planning to em- 
ploy data from the standardization. 
He also scolds Wechsler for his care- 
lessness in changing subtest instruc- 
tions between editions of the manual 
yet not presenting any restandardiza- 
tion. Gurvitz makes a most reason- 
able plea for more adequate standard- 
ization sampling in future work. 
The relationship between learning 
ability and W-B performance was 
investigated by McLean (76). By 
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studying neuropsychiatric patients 
with both the W-B and conventional 
learning tasks, the author was able to 
reach several conclusions: (a) higher 
V IQ is indicative of ability to learn 
in verbal situations, (6) higher P LQ 
is indicative of ability to learn in per- 
formance situations, and (c) subtest 
variation is inversely related to learn- 
ing proficiency. The h of 
learning psychologist is also encoun- 
tered in the experiment by Burik 
(19). He found that motor 
scores, such as rate-of-tapping, were 
most closely related to DS. ‘There 
was also good evidence that the in- 
cidental learning of the symbols was 
not related to the DS score. He con- 
cludes that the DS subtest should be 
regarded more as a test of visuomotor 
coordination than as a 
learning. 

Hlypnosis was used by Kline (67) to 
demonstrate the W-B in action. He 
hypnotically regressed and progressed 
a 22-year-old woman. W-B's 
obtained in these two states 


touc the 


test- 


test of new 


were 
as well 
as at normal age and in a waking 
state. Under the varying conditions 
the IQ remained surprisingly con- 
stant although weighted scores were 
necessarily diminished for both the 
CA’s of eight and 65 The 
changes in subtests with progressed 
conformed remarkably well to 
clinical expectancy. The DQ also is 
appropriate for that age group. Even 
item six on BD, which is not in the 
correct order of difficulty, was failed 
when the subject “believed”’ she was 
65. It would be expected that the 
“conforming” would 
for failing the last and more complex- 
appearing design which employs all 
the blocks. Not much is proved, but 
this study makes interesting reading. 
A thesis by Ficca (39) relating auto- 
nomic features to W-B performance 
sounded interesting but was not seen. 

A series of studies by Stacey and 


year,rs. 


age 


subject select 
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(113, 114, 115) investigated 
Gerstein’s hypothesis about concep- 
tual performance. Gerstein desig- 
nates the descriptive definition as 
lowest in intellectual requirements, 
while the functional is intermediate; 
and the conceptual definition is most 
with high intelligence. 
Subnormals failed to confirm the pre- 


others 


associated 


dictions of frequency of occurrence for 
the three types of definitions for V 
114). A similar result was obtained 
when concentrating on S answers 
113). In the most recent study, 
Stacey and S (115) switched 
to superiors for a study of V re- 
functional was actually 
with 


anier 


sponses. ‘| he 


t 
} 
1 


o-a¢ wy ae" ~ 13 . 
associated ower intelligence 


than was the descriptive-tvpe defini- 
tion. This finding was in line with the 
two studies (113, 114) 
suggests that the descriptive-type 
nition is of a slightly higher level 

in the functional definition. 
Summary. 


ole - 
earher 


and 


There seems to be a 
in comprehensive, 
theoretical articles this time (8, 13, 
23, 54. 76, 83, 89). 
t 
\ 


welcome increase 
The sophistica- 
ion of researchers seems to be ad- 

1 Some evidence of this is 
found in the rel large number of 
factor-analvtically designed studies 
in this review (3, 13, 24, 26, 136). 


Also new is the learning theory de- 


ancing. 


itive 


sign focused upon test theory ration- 
ale (19, 76). In addition to the more 
comprehensive studies, there is an 
of knowledgeabout 
the individual sub 


ever-growing bod\ 
the ration ile ot! 


tests. 


THE WECHSLER-BELLEVUE AS A 
DIAGNOSTIC AID 


The use of the W-B as a diagnostic 
aid implies an assumption most re- 
cently expressed by Jastak Br Aaa in- 
a global trait but a 
general and pervasive part function 
of the personality.” Persistent criti- 
cism of such an assumption is ex- 


telligence is not 
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pressed most recently in various re- 
search and by Schofield (105). 


As a Measure of Emotional Factors 


Brower in 1947 proposed that a 
negative correlation existed between 
the Hy, Hs, and Pd scales of the Min- 
nesota Multiphasic Personality In- 
ventory and the W-B IQ, the prob- 
able rationale being that neurotics 
and persons with character disorders 
would show intellectual impairment. 
Winfield (138) checked this hypothe- 
sis and found no support for it. 

Other interest in the relationship 
between personality tests and intel- 
lectual performance is confined to the 
Rorschach test. Probably most com- 
prehensive is the study by Holzberg 
and Belmont (61). They hvypothe- 
sized a number of relationships be- 
tween Rorschach signs and W-B test 
performance, but only four of their 
many predictions were substantiated. 
They set forth their empirical find- 
ings under eight features of 
schach performance with counter- 
parts in W-B performance. A single 
hypothesis was tested by Burnham 


(20). 


Ror- 


He failed to find support for 
the hypothesis that there is a 


rela- 
tionship between 77°] and PA. How- 
ever, the thesis by Spaner (111) 
touched upon related material. He 
was able to demonstrate a relation- 
ship between JJ's on the Rorschach 
and PA score. 
pothesized — relationships 
Rorschach and W-B_ performances 
were confirmed. Another thesis (122) 
not seen, also deals with the relation- 
ship between Rorschach features and 
W-B performance. 

Summary. Attempts to validate 
the general assumption that seme- 
thing other than a mere 10 may be 
extracted from performance on the 
W-B have been disappointing. How- 
ever, as in the case of the comparison 
between the Rorschach and the W-B, 


ten hy- 
between 


Five of the 
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it might be reiterated that (a) the 
Rorschach statistically 
valid measures of personality have 
not been established, and (6) the 
rationales for the subtests as pro- 
posed by Wechsler have been chal- 
lenged. 


scores, as 


As a Measure of Change Following 
Therapy 

Markwell, Wheeler, and Kitzinger 
(81) recorded the W-B performance 
of schizophrenics before and after 
prefrontal lobotomy; and Smykal 
and Wilson (110) did the same with 
electroconvulsive therapy. Mark- 
well, Wheeler, and Kitzinger found no 
statistically significant differences be- 
tween pre- and postoperative subtest 
performance; Smvkal and Wilson 
found some gross group differentia- 
tion but considerable individual dif- 
ferences to attenuate the positive 
aspect of their findings. 

In the Smykal study, the W-B was 
administered before treatment and 
after the fifth and tenth administra- 
tion of shock (shock being 
twice a week for five weeks). Inter- 
esting results demonstrated (a) the 
percentage of IQ's of 35 or below de- 
creased 50 per cent after the fifth 
shock; the Markwell group found that 
lobotomy increased the testability of 
the patients by 20 per cent, (6) the 
highest point of efficiency was at the 
midpoint of treatment, after the fifth 
shock; and (c) the preshock pattern 
corresponded to Rapaport’s chronic 
schizophrenic group, while the post- 
fifth shock collated highly with his 
acute group. 

Summary. 
demonstrate 


civen 


failed to 
differences in 


Research 
subtest 
performance reliably following either 
prefrontal lobotomy or ECT. How- 
ever, the generalization of the two 
researches, reviewed in this section, 
must be delimited by the restricted 
size and nature of their sample popu- 
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lations. What the research does at- 
test to is the fact that, as the result 
of therapy, the patient does become 


more amenable to testing. 
Mental Deterioration Index 

Validity. Tests of the validity of 
Wechsler’s (131) Mental 
tion Index (MDI) 
generally negative results even though 
Howell (62) found that, ‘... it may 
be tentatively concluded that Wechs- 
ler’s assumption of deterioration at 
all levels is correct.” 
Fassett (28) culled the test protocols 
for 100 inmates of San Quentin pris- 
on. In this 
the total population was divided into 
twelve five-vear 
with ages 15-19 
These 


and large 


Deteriora- 


have produced 


1 


Corsini and 


CrOSS-Se¢ tional study, 

periods, starting 
and going to 70-74. 
found that by 


test scores 1n- 


investigators 
the verbal 
crease with age while pertormance 


re “ 
suggesting a JV-/ 


than the “hold,” “don't 
hold”’ ratio, as the more sensitive 


scores dec line, 


ratio rather 
measure of mental deterioration 
] good agreement 
with those reported by Birren 13 
Bensberg and Sloan (10) tested the 
validity of Wechsler’s MDI 
group of mental detectives. 
culled their protoc ols from the files of 
the Lincoln State School and Colon: 
excluding those individuals who had 
not rece ived S-B IQ's of 42 or be tter, 
and who showed evidence of psychot- 
The results 
with the W-B were compared with 
those on the Arthur Point Scale and 
the S-B. The results suggested (a 
although the Arthur and the S-B 
demonstrated a negatively ack elerat- 
ing function with regard to CA incre- 
the W-B scores tended to in- 
crease, rather than decrease, and (} 
the W-B scores did not differentiate 
the older group (30-55) from their 
earlier MDI scores obtained between 
the ages of 15 and 24. The authors 


These findings are 11 


ic or organic processes. 


ments, 
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conclude that, ...the ‘normal 
deterioration’ which Wechsler found 
for older subjects may have been an 
artifact due to faulty sampling, at 
least for the lower intelligence groups.” 

The study by Fox and Birren (41), 
however, that when psy- 
chotic or neurological dvsfunctioning 
is excluded, a grout of sixty-vear- 
olds are significantly different in sub- 
test patterning from a group of 69- 
vear-olds. Fox and Birren find that 
only two of Wechsler’s “hold”’ group 
“hold,” viz.,. V and C. 

Glik (46) offers a. very interesting 
thesis to explain the inability of meas- 
ures oO! as Wechsler’s 
MDI, to reflect intellectual deteriora- 
He assumes that 
the manner of measuring present func- 
tioning is in Glik reasons that 
the ability to recognize the meaning of 
items one once knew is a more sensi- 
tive deterioration than 
items the subject 
in the present. 

Testing this assumption on the V 
and J subtests, Glik found a signifi- 
cant t between recall and recognition 

n J items only. Glik generalizes his 
ults to verbal questions without 
really adequate substantiation 

of this point, 


suggests 


deterioration, 
tion adequately 2 


error. 


measure of 
measuring what 


11. -7]) 
can actually recall 


although his proposals 
surely merit further study. At any 
rate, Glik’s results do suggest that a 
discrepancy score between recall and 
ition may be more meaningful 
than the method of recall now used 
with V and J items. 
In senile psychosis. 


recog! 


The major 
question to be asked here is: does the 
addition of a ps\ chotic process super- 
‘‘normal’’senescent 


alter the W-B 


imposed upon the 
decline significantly 
pertormance? 
Berkowitz (11), Botwinick and Bir- 
ren (15), and Doerken and Green- 
bloom (36), tested the efficiency of 
the W-B to differentiate ‘normal’ 
and “abnormal” seniles and found 


ae 
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positive results, while Botwinick and 
Birren (16) found that the MDI does 
not differentiate a control group of 
seniles from an experimental group of 
seniles with psychosis. The investi- 
gators of the latter study as well as 
Birren (13) emphasize that mental ill- 
ness in these aged caused a greater 
deficit in P 10 than V IQ although 
there wis good evidence of inroads on 
Verbal performance also. 

In Organic brain disease. Anderson 
(6) and Ptacek and Young (94) con- 
ducted research in attempts 
termine how effective the MDI is in 
identifying mental deterioration in 
organics. The results point to a re- 
jection of the sensitivity of the MDI. 

Anderson (6) addressed himself to 
the effects of laterality of localization 
of brain damage and its effect upon 
the MDI, with the results being nega- 
tive. Although the subjects 
identified neurologically as organic, 
they were not cross identified bv the 
MDI 

Wheeler and Wilkins (135 
the Hewson ratio, an empirical for- 
mula devised to differentiate organics 
from nonorganics, but found it to be 
lacking in ability to make individual 
predictions. 

In neurost O’Connor (90) tested 
the effect of neurosis upon the MDI. 
The results indicate that the “hold” 
versus “don’t hold” ratio is inapplica- 
ble to neurosis. 

Summary. Research attempting 
empirically to define the utility of the 
MDI has eventuated in negative re 
sults. The MDI has proven to be in- 
sensitive to deterioration in senility, 
with or without psychosis, organic 
brain disease, and The re- 
search findings did suggest that the 
present “hold’’-“don’t hold’ ratio 
was not sensitive to changes due toa 
deteriorative process, but that the 
verbal-performance dichotomy was. 

Regardless of the nature of the 


to de- 


were 


tested 


neurosis. 
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findings per se, as we look back over 
the various investigations conducted 
in this particular area with the W-B, 
one is struck by the number of impor- 
tant methodological criticisms that 
may be leveled against the experi- 
mentation. In the first place, two 
validity studies use highly restricted 
Both these in- 
vestigations rejected the utility of the 
MDI with their populations, but it is 
hard to generalize as to the validity 
of the MDI with other populations. 
It is most difficult to propose seri 
ously an index of deterioration that 
would cover the full range of intelli- 
Probably Wechsler 
was too ambitious. It is still more 
difficult to make cross comparisons of 
studies with such wide ranges of vari 
ables which are known to relate to 
subtest patterning, viz., IQ, CA, edu 
cation, and vocational skills. Such 
variables only serve to confound the 
data and confuse the research worker. 
Even more difficult to interpret are 


sample populations. 


gence and age. 


results from a single sample composed 
of both psychotics and nonpsychotics, 
organicity due to trauma, and organ 


] 


icity due to endogenous causes, and 


even varying degrees of organicity 
and central involvement (94). Ander 
son in another study (7), attempting 
the laterality of 
localization of brain d image, has this 
tosay abouthisownsample,“ . 


to detine effect of 


-. non 


dominant hemisphere sub-group .. . 
was significantly older than the dom1- 
nanthemispheregroup,” and“... . al 
though brain damage was established 


. the 
criterion of unilaterality is probably 
relatively poor.” 

It appears that the conclusions 
drawn from the results of the investi- 
gations attempting to define the ef- 
fectiveness of the MDI must be ten- 
tative because of restricted or ques- 
tionable sampling. These have ap- 
parently introduced distortion into 


beyond any reasonable doubt 
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the subtest performance, confounding 
the results. Yet 
agreement that the 
effective with 
ployed. 


there is unitorm 
MDI was not 


those samples m- 


: vet ow 7 D494 . po 
Scatter and Pattern Anal) 


Considerable research ha 


conducted 


on patterns ol 


ance on the subtests, and o1 
Since the last review, empl 
been placed on empiricall 

tively de rived analy 


obiec 
erns, and consider ble clini 


resea4re h 


icWumen h s beet 


W1les 
ul 


into this intriguing q 


Ge reé ral 


find 
that pattern ana 
rather 
a g-score tI 
weighted score 
‘r inherent it 
and Benso1 


with the 


Witten! 
found tha 


inversely 


and variat 
. Drevdahl, 

a Pearson 
at .O1 level 
anxiety (Hypoch 
iSIS, Depression, and Ps\ | 
Scales on the MMPI 
the subte ts, and 
Moldaw 


apt indicator 


measures of 


found 


However, 1 

or Matarazzo (82 
tionship between anxiet and 
formance on the subtests. Kaldegg 
(65), Love (73), and Wittenborn and 
Holzberg (139) found no significant 
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relationship between variability of 
subtest performance and pathology; 
and Rakusin (98) found “. 
of uniformity in normal scatter pat- 
terns on vocabulary s« atter.”’ Raku- 
i found a significant 
ce on the patterns of subtests, 
non- 
group, but also dis- 
ditfer- 


ppeared when the efiects of 


-a lac k 


in, incidentally, 
lilierel 
ra clinic (maladjusted) and 
cli ( 
covered tl these apparent 
ences dis 
total scatter were alle- 

method of multipl 

‘ ikusin con ludes 98 : 

obtained could not 

attributed to maladjustment.” 


rill and Heathers (85 


» basis of their 


} 
conciude 
researcn on col- 
is to he 


idents, that “scatter 


; 
iT 
Ups O 


above average 
is.” Jastak found 
schizophrenics, and 
t differentiated by 
, and Wittenborn 
139), in a chi-square 
performance of para- 
manic-depres- 
hotics, and psv- 

d their 


performances 11 


ilities, four 
group 
ible. 
‘rg, Alessi, 


ability of 


and Talkoff (60 
seven judges to 
itelligence of te: 
at Connecticut 
In general, the 
le their evaluations on t! 

> amount of intertest sc: 

he individual protocols. I 
tercorrelations between the judge's 
estimates ranged from .15 to .87. 
However, such a matrix of correla- 
t lls little about the over-all 


ions te iit 
intercorrelation of the 


judges, Le. 
how well were they doing as a group. 
Hence, one of the present 
GF) simply reproduced the complete 
correlation matrix from that pre- 
sented in the article, converted the 
rho’s into ranks, and computed a W, 


Writers 
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a coefficient of concordance, a non- 
parametric multiple-rank correlation 
technique, which vields an over-all 
coefficient of correlation. 
(not significant) hence indicating 
that there was poor agreement be- 
tween the judges. The poor results 
of the Connecticut research may be 
a function of the inexperience of the 
judges since four of the seven were 
interns, the other three being staff 
members. 

Methodology. Before we discuss the 
research on pattern and scatter an- 
alysis with regard to the various dis- 
ease entities, the writers deem it nec- 
essary to preface the discussion. An- 
tecedent to an understanding of the 
effect of psychopathology upon W-B 
performance seems to be the need to 
recognize that factors other than psy- 
chopathology affect subtest perform- 
ance, and must be taken into account 
in such analysis. For instance, Cohen 
(23) found an examiner bias; Collins 
(27) found variation in subtest as a 
function of age; many (18, 48, 51, 89, 
109, 120, 125) found significant devia- 
tions on subtest scores due mainly to 
sex; Aronov (9) and O'Connor (90) 
testifv to the effect of education on 
scatter; and Aronov (9), French and 
Hunt (44), Merrill and Heathers (85), 
Norman (89), O'Connor (90), Ortar 
(92), and Schnadt (104) validate the 
effect of 1Q level per se on scatter. 

The reader will 


W was .27 


understand why 


knowledge of these findings is propae- 
deutic to an adequate purview of the 
research on pattérn analysis with the 


W-B. In attempts to control for the 
effects of these variables, the research 
worker has relied upon the statistical 
concept of randomization. Statisti- 
cally the assumption is maintained 
that if you randomize the effects of 
these uncontrollable, hence, unmeas- 
urable parameters, equally through- 
out the cells of one’s table, the effects 
of the randomization will be to 
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equalize the effect of these parame- 
ters upon the variance. Such statis- 
tical reasoning has been translated 
into research methodology in the 
form of the matched group design. 
However, research suggests 
that the simple randomized design is 
not the answer to a research worker's 
prayers. 

Frank, Corrie, and Fogel (43) and 
Reich (99) have demonstrated that 
we are no longer meaningfully speak- 
ing about “neurotics’’ or ‘“‘schizo- 
phrenics’’ when we include’ wide 
ranges of attributes (age, education, 
IQ) in one group. For instance, Reich 
(99) demonstrated that when he com 
pared the subtest performance of his 
group of schizophrenics from Kings 
County Hospital, with data from 
other groups of subjects, similarly di- 
agnosed, presented in previous pub- 
lished literature, the rank-order cor- 
relations ranged from .84 to .19. 
Frank, Corrie, and Fogel (43) per- 
formed an analysis of variance on the 
subtest performance of like-diagnosed 
presented in the literature 
which included wide ranges of IQ, 
education, age, etc., and found them 
to be statistically different. If one 
begins by comparing incomparables, 
the results are bound to be spurious. 

Subtest 


certain 


ac 


Cases 


distortion is not just a 
methodology, 
but may accrue as a result of factors 
inherent in the W-B itself. For in- 
stance, Cohen (26) tinds that the W- 
B subtests do not always appear to 
measure the same factors in different 
types ol patients. Marks (80) criti- 
cized Wechsler’s qualitative pattern 


function of research 


approach to scatter analysis, since it 
permits of a wide range of subtest 
variation coding within a given diag- 
nostic category, such as from ++ to 
—. This tends to “load the data” in 
terms of a lack of reliability and 
validity of Wechsler’s diagnostic pat- 
tern. It appears ‘‘safe’’ to review re- 
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search on pattern analysis in the dif 
ferent diagnostic categories now that 
the 
sounded. 

Schizophrenic patterning. 
(55 Fisher's 


function to pattern analysis. 


“Warning alarm’ has been 
Harper 
discriminant 
In gen- 
eral, Harper felt that the pattern ol 
subtests differentiated his 
schizophrenics from his norn 
further, that the subtypes, 
and catatonic, 
nificantly differe 
of the 
mals. 


applied 


group ol 
ials, and 
paranoid, 
hebephrenic, were sig- 
itiated from the rest 
schizophrenics and the 


Not 


Harper wrote, as 


equation he | 


yression 


“The extent t 


be ay ) 


if 
weights would 


sample from a ditfer 
is not known 
was needed, so on suggestio1 1). 1 
mon Machover, Reich (99) performed 
this test. He found that tl pel 
equation identified 65 per 
new schizophrenic po] 
Hospital 
concluding that, 
individual differentiation 
On a further suggestion by 
Frank (42 


Harper's equatior 
| | 


I 
Kings Count, 
phrenic, 
Was poor 
Ml ichover, 
tested the effective 
when 


1 ‘ 


group ot ps\ tl 


heterogeneous chotics 
KKings 
that the 
ed 47 pet 


pulation as 


excluding schizophrenics) at 
County Hospital. He found 
Harper equation misidenti 
cent of this p< schizo- 
phrenic, the reby 
utility of the formula. 
100 
phrenics and neurotics on fifteen signs 
postulated by Rabin, Rap port, and 
Wechsler, and found only eight of 
them tenable. However, | 

the vari subtypes of 
schizophrenia under a single heading. 
On the other hand, on the basis of a 
comparison of 50 netrotics and 50 


questioning he 


Rogers compared — schizo 


grouped ious 


schizophrenics, little agreement was 
found between Wechsler pattern and 
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psvchiatric diagnosis. Whiteman and 
Whiteman (137) had 


using factor analysis on the perform- 


some success 


ance of 50 schizophrenics and 50 
police applicants, finding that clusters 
entitled “reality and 
“psychomotor effi iency’ significantly 
differentiated the groups. MeNeal 
(77 ( he¢ ked We hsler’s for 
schizophrenia on equated normal and 


perception” 


signs 


schizophrenic groups for 340 male 
veterans of World War II. “Only one 
of Wechsler’s sig 

ficantly 


but 


ns discriminates sig- 
. groups 
it identifies schizophrenics 
rmal than normals.” What 
rs to be a ver’ Interesting and 
Mon- 


mn that schizo- 


between the. 


ht-provoking finding is 
88) den 

Ics With average or above aver- 

1] tual h id no 

and that ex- 

characteristic of 

with low 


more 


ob- 


n the affective disorders. 


; 
could find but one in- 
this area during the en- 
apparent 
for more research. 


(126) found dif- 


manic-depressives 


Veal period, an 
he need 

id Guy 

wetween 

ls, but 

manic states of the manic- 

depressive psychosis. However, fur- 
lysis revealed that the pa- 
rameter of age was contributing an un- 
t of variance to the pat- 


none between de- 


, 
ther ana 


known amout 
hence lepre lating the meaning- 


tern, ( 
fulness of the 


results. 

fervning im meurosis. Monroe 

factorial design to test 

t of levels of adjustment, in- 
and locality, found little 


iation between neurotics and 


~ 


well-adjusted normals when the in- 
teraction of these effects were deleted 
from the between-group variance 
Schillo (103) found no difference be- 
tween neurotics and normals on dis- 
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parity between verbal and pertorm- 
ance IQ, performance on individual 
tests, or rank-order correlation of test 
performance, although his neurotics 
showed greater variability from its 
own group mean than the normals. 
Schillo’s subjects, though, included 
individuals subject to anxiety reac- 
tion, obsessive-compulsive reaction, 
and mixed neurosis; and no data are 
offered as to the possible differences 
within this group as related to the 
subtypes. 

Patterning in mental 
Alderdice and Butler (3 
the only 


defie lency. 
seem to be 
investigators interested in 
identifying a mental defective pat- 
tern. They present thoughtful ap- 
proaches to pattern analysis and com- 
pare their findings with those of pre- 
vious Their pattern for 
mental deticienc ¥; while nota sharply 
delineated one, is similar to those pre- 
viously reported. It is most at vari- 
ance with the pattern initially 
posed by Weclisler (131). Little work 
seems to have been done in terms o! 
an understanding of the performance 
of the mentally defective subject on 
the subtests of the W-B. One investi- 
gation, by McPherson and Fisch (78) 
attempts tounderstand the defective’s 
poor performance on S. Using ap- 
proximately 30 subjects with an IQ 
of from 55 to 85, they found 
that 66 per cent of the failures were 
due to evasiveness, what McPherson 
and Fisch term “‘learned negativism.” 
The authors suggest that 
of such 
yield an exact measure of the defec- 
tive’s ability. 

Patterning in organicity. 
(27) investigating the effect of epilep- 
tic involvement found that only « 
diagnosed with demon- 
strated deterioration. Further analy- 
sis revealed the superiority of the 
performance of the endogenous to the 
exogenous group. A comparison oj 


workers. 


pro- 


range 


. iit 
as a Fresuit 


behavior, testing may not 


Collins 


ases 


organicity 
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Collins’ 400 outpatient epileptics 
with the protocols of institutionalized 
groups of epileptics, psychotics, neu- 
rotics, and psychopaths, presented in 
previous research, showed consider- 
able differences. In the analysis of 26 
cerebral arteriosclerotic patients com- 
pared with 26 patients with “other 
forms of cerebral pathol ey” matched 
for age, education, and IQ, Oppen 
heim (91) found no significant differ- 
ences on subtests or in total varia 
bility of performance. 

In one Diers 
Brown (35) found that there was an 
inverse relationship between W-B IO 
and the Hughes factor-analy tically 
the Rorschach for 
intracranial pathology. In another 
investiga Diers and Brown (34 
analyzed the protocols of 24 patients 


investigation and 


derived signs on 
tion 


diagnose d as 


sis. When 


having multiple sclero- 
compared with normal 
lerotic group demon- 
memory span for D 


’ 


’; however, a Pearson 


a) 


CO itrols, the sck 
strated a lower 


and superior P¢ 
yr of .17 (not was obtained 
bet we en Wi chsler’s organi 
actual organ 


significant 
ens and 
icitv. The authors offer 
the following alternatives to explain 
their results: “Quantitative signs of 
damage to the brain, or the 
ration on the Wechs- 
ler-Bellevue si ile, are inadequate as 


Organic 


index of deteri 


an indicator of existing cortical dam 
in multiple sclerosis,” or ‘‘No 
cortical pathologic changes existed in 
the population with multiple sclerosis 
comprising this study.” 
Patterning of the sociopath. The 
patterning of the sociopath (psycho- 
patl 


ath, delinquent) has come under 
considerable scrutiny since the last 
Bernstein and Corsini (12 
tested the validity of Wechsler’s as- 
that performance 


are higher than verbal scores 


review. 

sumption scores 
which 
they could not reject. By the method 
of forward and backward presenta- 


tion of the subtests, they rejected 
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the assumption that this was a spuri- 
ous difference due only to the fact 
that the performance part of the test 
is administered late in the examina- 
tion period permitting the sociopath 
to adjust more adequately. 

Graham (49) found that there was 
a significant relationship between 
performance on the W-B and school 
performance, and that the scatter- 
gram of Wechsler’s adolescent psy- 
chopath closely approximates that 
Graham 
makes the appealing suggestion, “It 


of the unsuccessful reader. 


does not seem 
that the 


— | 
adolescent psv¢e hopath 


unreasonable tO as- 
protile Wechsler’s 
is typical of 
youth 
his moral qualities.” 
eisen 125) 
the W-B 


¢ 11 y : leaner 
Lroups age, l tellige: 


sume 


educationally retarded with 
tested the 
difference on between 
matched 
SOC ioeconomk background 
linquent and nondelinquent gi ul 
Higher scores on Dand 
inquen thar 
was about all thev 
iate the two groups. 
: 


1 
compared the subtest 


matched group of 


rformance of a | 


iths 
ites, and 


1) 1; ’ ] ‘ } . 
lates diagnosed as psycho 


hopathic’ inm 


; : ; 
indardization group ol 


i 


Through a chi-square 
igni results 


t-test ot iwnihcance, the 
‘ndi 1 that there wv ' harac- 
indicated that there was no charac 
teristic subtest pattern for the psy- 
chopath, and, further, that there were 
no differences between the “psycho 
pathic” and the “normal” prison in- 


mate, and between two and 


Wechsler’s psychopathic po} ulation. 

In a similar study, Clark and 
Moore (22) had previously attempted 
to differentiate the test patterns of 
subclassifications of military offenders 
(no NP disorder, immaturity 
tion, and pathological personality 
types—presumably psychopaths) but 
with no success. 


these 


reac- 
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If one considers unauthorized dis- 
charge from a medical hospital as 
asocial or antisocial behavior, Thur- 
ston and Claden’s (123) study on the 
irregular discharge of tuberculosis 
pertinent here. 
However, the W-B could not differ- 
the “AWOL” group from 
those who remained in the hospital, 
either in terms of IQ or subtest pat- 
terning. 

Summary. 


patients becomes 


entiate 


Our somewhat jaun- 
continues to reject the 
assumption of unique subtest per- 
S¢ hizophrenics. The 
at best inconsistent, but 
inconsisten« y may testify 
erroneous assumption that 
randomiza will control for the 
effect of the parameters upon subtest 
But the reader of such 
1es is left in the predicament 
of being uncertain whether to at- 
tribute the negative results to either 
the methodology or to the hypothesis. 

For mental defectives 
patterning better established 
than for s¢ hizophreni s. The search 
for patterns of performance charac- 
teristic of the organic brain-damaged, 
nic depressives, and neurotics 
been rather fruitless but, as 
the sampling 
methods have left much to be desired. 

Researches with the sociopath indi- 
cate some points of agreement but 
there is still much disagreement. The 
pattern proposed by Wechsler, “‘ado- 
lescent psychopath,” is coming to be 
evaluated in terms of the subjects’ 


academic motivation, and 


1: 
diced eve 


formances by 
re ults are 
their very 

} 


LO i11e 


tion 


performance. 


resea;r(¢ | 


groups of 


seems 


have 


with  s« hizophre iC s, 


training, 
general background of experience. 
The impression gained 
from the pattern studies reviewed is 
that the inconclusive. 
Patterns suitable for clinical use will 
not be forthcoming until methodo- 
logical improvements appear. Fur- 
ther advances in typology can do 
much to establish suitable diagnostic 


over- il] 


findings are 
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criteria for which patterns can legiti- 
mately be sought. 


GENERAL SUMMARY 
The past five exhibit two 
major changes in the research trends 
with the W-B. In the first place, 
there has been a realignment of the 
general nature of the studies. 
search with 


years 


Re- 
psychiatric syndromes 
has been reduced, whereas a larger 
proportion of the studies reviewed 
deal with the W-B as a test of general 
intelligence, investigating its relia- 
bility, validity, rationale, Ctl. Sec- 
ondly, it has been noted throughout 
the review that the number of well- 
controlled and statistically sophisti- 
cated studies has markedly increased. 

One rather new and somewhat un- 
expected trend is the demonstration 
of sex differences. 
additional 


It points up an 
uncontrolled factor in 
previous studies and has important 
implications for the use of test pat- 


terns, scatter, and such. Of course, 


the, need for a larger, more repre- 
sentative sample for the demonstra- 
tion of sex differences, or 


their ab- 
sence, at different levels of intelli- 
gence still remains. It may be added 
that another factor that 
trol in research is that of 
nomic level. Social class has been 
found to have a bearing on intelli- 


needs con- 


SOC1IOeCCO- 
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gence test achievement. This remains 
comparatively unexplored with the 
W-B. Moreover, it may have an im- 
portant relationship to the Verbal- 
Performance discrepancies noted in 
psychopaths, delinquents, and others 
whose status in such classifications 
may be related to socioeconomic level 
as well. 

When one looks at the work done 
with various psychiatric populations 
by means of scatter and patterning 
methods, one can readily 
that 


conclude 
been added” 
either in methodology or in definite 
findings. The results are still “incon- 
clusive.”’ It may be wondered 
whether it should not that 
there are no positive results instead 
of leaving an open crack in the door, 
implicitly indicating a still-tenacious 
clinging to an overworked hy pothesis. 

Mavbe, with the 
newly standardized instrument, simi- 
lar in structure to the W-B, but not 
suffering from the numerous weak- 
nesses which have been pointed up 


“nothing new has 


be said 


creation of a 


in this review and in previous ones, 
more fruitful research wit! 
analysis will be forthcoming. 


pattern 

How- 
ever, such research cannot continue 
to close the eves to the weakness of 
the criterion itself—psychiatric diag- 
nostic classification. 
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THE INTERMEDIARY 


KEY 


IN THE ANALYSIS OF 


INTERPERSONAL PERCEPTION! 
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What we are here calling inter- 
perception has also 
called empathy, sensitivity, under- 
standing, 
tion, ete. 


is been 


personal 


diagnosis, social percep- 
In recent years, the follow- 
ing procedure has been much used 
in the investigation of interpersonal 
perception: A a group, 
called the Other, provides a self- 
description, as by filling out a per- 
sonality inventory or rating scale. 
Another called the Judge, 
predicts the Other's self-description, 
filling out the same inventory or rat- 
ing scale with the responses he pre- 
dicts the Other would give. 


person, or 


person, 


‘he 
the 
the 


Judge's accuracy score is then 
closeness of his predictions to 
actual responses of the Other. 
To many psychologists (e.g., 8, 14 
this operation has qualified almost by 
definition as a measure of the Judge’s 
ability to empathize with the Other 
It does seem, at first glance, that the 
Judge should have to ‘feel himself 
into” the Other's personality, feel- 
ings, attitudes, self concept, and the 
like, if he is to predict accurately the 
Other's This ap- 
parently straightforward technique 
does not, however, vield 
wholesome data. Rather, beneaih 
its surface, we find an intricate 
plex of processes and components. 


self-desc riptions. 
simple, 


comi- 


' A draft of this paper was presented at a 
symposium on interper ption durit 
the APA meetings, September, 1954. This in- 
vestigation was supported by research 
grant (M-650) from the Institute of Mental 
Health, National Institutes of Health, Public 
Health Service. 
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Our purposes in this paper are (a) to 
identify, classify, and illustrate inter 
mediary keys-—the common meth 
odological element in many d 
that have been used to reveal 
complexity; (6) to describe various 
considerations in the use of this tech 
nique; and (c) to formulate the re! 
tionship between our methodologi: 
model and the analysis-of-varianc: 
approach developed by others. 


evices 


till 


cl 


Tyres OF INTERMEDIARY KEY 
Our basic idea is that many of t! 

approaches developed independen 

by a host of investigators may 
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it 
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developii cy 
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a protocol, Le. \ 


sists 
set of answers to the items, against 
compared the usual tw 


kind of 


‘rception research: the Other's 


which are 


protocols in this interper 


sonal pe 
self-description and the Judge's pre- 
diction We can then obtain n 
ures of similarity of each of these pro- 


S. eas- 
tocols to the intermediary key or pro- 
tocol. The intermediary key provides 
an organizing principle for much of 
the recent research in interpersonal 
perception. Our examples, each of 
which fall 
a) a priori keys, 
made out by the psychologist on the 
basis of some theoretically significant 
psychological variable; (dD) kevs ob- 
tained by varying instructions to 
Judges; (c) keys based on central 
tendencies of predictions or 
descriptions. 


dese ribed below, into 


1S 


three categories: 


self- 
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1 Priort Keys 

1. An intermediary key consisting 

the affirmative responses to all of 
items is used in work (4) on re- 
ise sets, such as acquiescence, in 
-taking. 
tions and the Other's self-des« rip- 
ns with such 
ws that the accuracy score can be 


Scoring the Judge's pre- 
an acquiescence key 
a coin idence betwee n 


thi 
they have 


luenced by 


tend ncies ol! 


acqurescence 


r more 
nventory 
1: 


lictions are scored with 


Ip of the favorable cho1 


ni I vorabilits -of-pred 
These 


the accuracy 


will correlate po 
scores 

Other described hin 
Hence, favorabilits 
fo expose an ncealed 


ispect of interpersonal perception. 
Then accuracy may r 
perceiving cues an 

lates, from having intuit 

sinfiihlung. Rather, it 


trom a {tt rtuitous concomit 


tween the Judge's favorable impres- 


sion concerning an Other or toward a 
group of which the Other is a mem- 
ber, and the Other's tendency to de- 
scribe himself favorably. 

3. An adjustment key was 
19) when the predictions and self- 
descriptions were obtained on the 


used 
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sell Adjustment Inventory. This 
made possible measures of the Judges’ 
attribution of adjustment and the 
Others’ adjustment. 
Such measures should have much in 
common with those vielded by the 
favorability key described above. 


self-described 


Keys Obtained by Varying Instructions 
{o Judges 
4. Suppose we 


} ; . 
S€i;-de criplion 


obtain the Judge's 
‘ the items 
as those on which he predicted the 
Other’s response (2, 9, 10, 17, 21). 
Comparing with the 
Other's self-description yields a score 
that has been called ‘‘real similarity”’ 
comparing the Judge's self-descrip- 
tion with his predictions of the Other 

ls a score for the Judge’s assumed 
‘ity between the Other and 

Chis procedure can be used 


accuracy 


on same 


this protocol 


down score 


} 


ts, “‘warranted 


and 


“warranted 
Judges are 
some cases because they 
similar to the Other 
igh 


By ae aoe 
ages are 


ssimilarity.”’ 


and 
similarity; in other in- 
J accurate because 
- dissimilar and assume little 
milarity. 
5. When the Judge predicts the 
the typical member of 
the subcultural group to which the 
Other belongs, can obtain a 
stereotype key (10). Applied to pre- 
dictions, such a key yields a “rigid- 
for the Judge, reflecting 
he degree to which he tends to see 
the Others as typical, and a simi- 
larity-to-st« reotype score for the 
Other. Accuracy occurs if the Judge 
follows his stereotype when the Other 
is actually similar to it. For a group 
of Others who strongly resemble a 
Judge's stereotype, we get high ac- 
curacy when the Judge is “rigid” in 


responses ol 


we 


+x 


1L\ score 
+} 
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the sense of thinking that the Others 
are all like his stereotype. 

6. Manifest stimulus value, the im- 
pression that the Judge forms as to 
the actual personality of the Other, 
can be distinguished from the Judge's 
prediction of how the Other will de- 
scribe himself. For example, an ef- 
feminate boy may have such a mani- 
fest stimulus value that we would de- 
scribe him as really preferring to play 
girls’ games. But we would predict 
that he would choose the boys’ games 
on an interest inventory because he 
could not be admit to 
interests so clearly out of line with his 
sex role. 
Judges to describe a boy as they 
think he really is and not necessarily 
as they predict he would respond to 
a personality inventory. 
ing manifest 
would be 


expected to 


Suppose we ask a group of 


The result- 
value key 
the modal 
riptions of the Other. Ap- 
plied to the Other's self-descriptions 
it vields an “‘insi 


stimulus 
made up of 


Judge's des: 


‘“insight-and-frankness”’ 
score (22). Applied to the predictions 
of Judges, it vields an attribu- 
tion-of-insight-and-frankness”’ 


| score, 


Now, assuming correct perception of 
the manifest stimulus value, accuracy 
can be fractionated into the Judge's 
warranted atiribution-of-insight-and- 
frankness and warranted attribution- 
of - self - deception - and -lack -of-frank- 


Hess. 


Keys Based on Central Tendencies of 
Predictions or Self-Descriptions 


- , 7 
7. Modal pr keys can take 


ediction 
two forms: (a) for each item the aver- 
age prediction obtained from a group 
of Judges in predicting the response of 
a single Other, and (>) for each item, 
the average prediction of a single 
Judge for several Others. ‘The first 
of these can break down the accuracy 
score into typicality-of-prediction 
and predictabilit y-of-the-Other’s-self- 
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description. When Judges make 
highly typical predictions for highly 
predictable Others, they are accu- 
rate; when they make atypical predic- 
tions for Others who are missed by 
the majority of the Judges, they will 
also be accurate. The second kind of 
modal prediction key can be used to 
obtain a measure of the Judge’s ‘‘im- 
plicit stereotype.” It is involved in 
measures of stereotype accuracy (5). 

8. A key embodying the modal 
self-description of the Others (15), 1.e., 
the manner in which the majority of 
Others describe themselves, yields a 
score for the similarity of the Judge's 
predictions to the modal self-descrip- 
tion and a score for the Other's typi- 
cality. \ccuracy then 
function of these two scores. 


becomes a 


Talland 


(23) also averaged self-descriptions 
but used them only as the key for 
scoring the individual member's ac- 
curacy in evaluating group opinion. 


EVALUATING AND USING INTER- 
MEDIARY KEYS 
Bases for 
Keys 


Evaluating Intermediary 

How can we choose from the many 
possibilities those intermediary keys 
that have genuine value in the analy- 
sis of interpersonal perception? Three 
evaluating intermediary 
keys are as follows: 


bases for 


1. The internal consistency over 
items of the score obtained by apply- 
ing the intermediary key to the 
Judges’ predictions. Unless this in- 
ternal consistency is substantial, the 
intermediary 
sidered psvchologically characteristic 
of the Judge. 

2. Degree of confounding of two or 
more possible intermediary keys. For 
example, the acquiescent choices on 
an inventory (e.g., the California F 
scale, the Minnesota Teacher Atti- 


score cannot be con- 
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tude Inventory) may also tend to be 
the unfavorable choices. If so, we 
can determine which of the two is 
operating only by revising our inven- 
tory. When the confounding is elimi- 
nated, scores obtained with one of the 
keys may lose their internal 
sistency. 


coli- 


the inter- 
vields scores logically 


3. Degree to which 
mediary key 
attributable to genuine social interac- 
tion between the Judge and the 
Other. In some situations, a given 
intermediary score may reflect a char- 
acteristic of the Judge which existed 
prior to his observation of the Other 
In other situations, the same inter- 
mediary reflect 
influences of observation or interac- 


score may genuine 


a favorable reaction to a 
particular Other. We should dis 
tinguish between h post- and 
preinteraction 


tion, e.2., 


intermediary 
Evidence for the preinteraction char- 


scores. 


acter of an intermediary score is ob 
from the 
erality of the score over Others, espe- 
cially if the Others are heterogeneous 
with respect to the variable(s). When 
the intermediary scores obtained on 


tained existence of gen- 


Judges’ predictions for heterogeneous 
Others correlate highly among them- 
probably 
autistically determined rather than 
determined by 
the Others. 


selves, the predic tions are 


evidence concerning 


Using Intermediary Keys 

These considerations can be used 
possible intermediary 
keys both empirically and logically. 
Those which prove upon examination 
to be relevant to the particular prob- 
lem at hand should be used in the an- 
alysis of interpersonal perception. 
Thus, use of intermediary kevs may 
reveal that the measures obtained in 
a given situation are highly loaded 


to examine 
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with irrelevant vari- 
ance. 

If so, such irrelevant influences on 
interpersonal perception measures 
can often be reduced by lowering the 
internal the inter- 
mediary obtained on the 
Judge's predictions. For example, 
the the 
Judge can be made more relevant to 
Then the 
Judge may have less tendency to fall 


conceptually 


consistency of 
score 
information available to 


the predictions requested. 


back upon autistic response sets. Al- 


though assumed similarity scores, for 
example, would still be obtainable, 
they would no longer be general over 
An item format in 
alternatives 
item to the next will 
reduce the likelihood of a positional 
response set (4) in the Judges’ pre- 
reliable individual dif- 
to choose the 

alternatives 
a forced-choice format, in 


items or Others. 
the 


change from one 


which response 


dictions, e.g 
tendency 
first of two 
Similarly, 

1 : } 
which 


ferences in 


res] 0ONSe 


the favorability of various 
choices has beer equated within each 
item, will reduce the reliable favora- 
bility-of-prediction the 
Judges’ 


Another method of minimizing the 


Variance in 


predictions. 


influence of general response disposi- 
tions is to give credit for accuracy 
when the Judge correctly dif 
ferentiates in his predictions for two 
Others (or Other and 
himself) who answered an item dif- 
A “refined empathy” score 
derived by subtracting the assumed 
similarity from the accuracy 
score has been suggested (12). Partial 


correlation has also been considered, 


only 


between an 
ferently. 


score 


i.e., basing the accuracy score on the 
partial correlation of the Judge's 
prediction with the Other's self-de- 
scription, holding real similarity con- 
stant. Both of these methods have 
proved on further examination to be 
inadequate (3, 11). We have scored 
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Judges’ predictions on only those 
items on which two Others responded 
differently, giving credit for accuracy 
only when the Judge correctly pre- 
dicted the difference. Corrected split- 
half reliabilities for this score for two 
groups of Judges each predicting for 
a different pair of Others were .65 
and .51. 


How Intermediary Scores Account for 
Nongenerality of Accuracy 


Intermediary keys have shed light 
on the failure to find generality over 
Others in measures of accuracy of 
interpersonal perception. Typically, 
accuracy in judging or predicting for 
one Other has correlated less than .35 
with accuracy in judging or predict- 
ing for a different Other (e.g., 1, 2, 
3, 7, 10, 13, 16). At the same time, 
generality over items in accuracy of 
predicting for a single Other has often 
been substantial, i.e., .7 and higher. 

How can we account for such re- 
sults? By use of intermediary keys, 
we have demonstrated (22) that 
standard Others, i.e., fifth-grade boys 
and girls presented to Judges by 
means of sound films, did serve as 
discriminable social stimuli. Specifi- 
cally, a favorability, key showed that 
the Judges made consistently and 
significantly more favorable predic- 
tions for some of the children than 
for others. The corrected split-half 
reliabilities of the favorability-of- 
prediction scores ranged from .81 to 
.90, with a median of .86. Now the 
accuracy of a Judge in predicting the 
responses of a particular child de- 
pends in part on the congruence be- 
tween the favorability of his percep- 
tion of the child and the favorability 
of the child’s self-description. Thus, 
the rank order in magnitude of four 
median correlations between favora- 
bility and accuracy was exactly the 
same as the rank order in favorability 
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of the four children’s  self-descrip- 
tions. 

It is instructive to examine the 
structure of the relationship between 
the favorability of a single Judge’s 
predictions and of the child's self- 
description. We can form a triad 
from (a) the prediction protocol of 
the Judge, () the self-description 
protocol of the Other, and (c) the 
favorability key. Matching each of 
these against the other yields three 
(ab) accuracy, (ac) favora- 
bility-of-prediction, and (dc) favora- 
bility-of-self-description. When any 
two of these scores or proportions are 
in any degree fixed, the third is par- 
tially determined. ‘Thus, suppose 
that, from observing the Other, the 
Judge forms an over-all impression 
such that he will predict favorably 
concerning the Other on 75 per cent 
of a set of two-choice items. Further, 
that the Other describes 
himself favorably on 75 per cent of 
the items. In this case, the ‘‘chance”’ 
level of accuracy with two-choice 
items will no longer be 50 per cent; 
rather, if no determinants of accuracy 
were operating other than the favora- 
bility ‘‘sets,”’ 


scores: 


supp se 


chance success on a set 
of favorability-loaded items would be 
62.5 per cent. Any key may be sub- 
stituted for the favorability key 
without altering the logic of the tri- 
adic relationship. 
our illustrations, as 
above, the relationship be 
the “intermediary key’ and 
the prediction and_ self-description 
protocols may be entirely or partially 
determined by actual social percep- 
tion of the Other by the Judge. The 
two scores yielded by the modal pre 
diction key, for example, are both at 
least partially determined by genuine 
social interaction between the Judges 
and the Other. 

But other keys may yield 


In some of 
noted 


tween 


two 
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scores of which one may be fixed be- 
fore the Judges see the Other. The 
favorability key earlier 
illustrates such a possibility. That is, 
the Other's self- 
ription is determined before the 
Judge observes the Other. When the 
intermediary key is the Judge’s prior 


discussed 


the favorabilitv of 


] 
MCs¢ 


another 
real 

by the 
s Other 


self-descrij tion, we have 


situation where one of the scores 
similarity) is not intluenced 
Judge's interaction with thi 
Phe Judge's assumed similarity to 


the 


termined by factors extrinsic 


1] 


Other may also be partially 
tion between Judge an 

been reported (20) that the 
irit\ 
and over var- 


interac 
It has 
tendency to assume simik may 
‘al over Others 
Onl 


content. to the extent 


assull 


ter! 


int 
the e ‘IGed the predict 


mav be influ- 
e-Other interac- 


it were a 


response disposition general over all 
ind all Others, the scor 
psyche logical s 

they do 


not 


SCOres See 


miteraction 
Other 

ered nothiu 

artifact. Thi 


} | 


the 


matical 


v of the 
Other to choose a 
to 
two-choice 


when in doubt as the 


alternative in a 


may vield a triad of this quasi- 
1 form. of course, 


mathematic: Here, 


the intermediary key consists of all 
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u responses. It is hard to imagine how 
such tendencies might be related to 
social interaction between Judge and 
Other, but it is readily apparent that 
they would influence the accuracy of 
the Judges’ predic tions. 

How does this discussion of inter- 
ary keys bear on the problem of 
generality of accuracy in perceiving 

We feel that the 
finding of reliability of accuracy for 
Other generality over 
Others can be explained by the fortui- 
these triad rela- 
ethods of measuring 
1ake the operation of 

yal judgments, regard- 
ity and typicality, and 
as the 
or to 
in the de- 


hen > 


medi 
standard persons? 


one vithout 
uus occurrence ofl 


Our n 


ns, such 


11) os Huy 
sin ilarit \ 


we 
are reli- 
‘le Other. 


over items 


a conclusive 

‘nerality of the 
predict the ot 
that 


sy our selection 


responses 
the Others 
ot 
we make it inevi 


ires 


ma 
the combination of the Judges’ 
and Others’ attributes will 
nt for each Other. Thus we 
‘ejudice the results against a spuri- 
lity die to Judge- 
and 
aye 
e ability on 


ites 


is kind of genera 

nse conti er ies, 
n relatively littl 
the part of the 


ine differentiations. 


Judges to make gen 


that 
generality over Others of the Judges’ 
response sets can for the 
generality over Others of predictive 
That is, in Crow's data, 


seemed to be 


Crow (7) has demonstrated 


account 


accuracy. 
accuracy general over 
Others to the degree that (a) those 
Others were homogeneous and (b) the 
Judges had response sets that were 
general over the Others. 


} 
} 
| 
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THE INTERMEDIARY KEY IN 
RELATION TO ANALYSIS OF 
VARIANCE 


The intermediary key explicitly 
generalizes many methods of analyz- 
ing interpersonal perception data 
that have already been used. One 
other major approach to analyzing 
interpersonal perception has been 
developed which has not, at first 
glance, involved such keys. This is 
the analysis of variance design 
veloped by Cronbach (5). 

The relationship between the in- 
termediary key and analysis of vari- 
ance approach can be seen by refer- 
ring to the k-space model developed 
by Osgood and Suci (18) and by 
Cronbach and Gleser (6). In this 
model, the Judge’s predictions and 
the Other's self-descriptions are each 
represented as a point in a k-space, 
where k& is the number of items or 
scales. Accuracy is then measured 
inversely as the distance, D, between 
the two points. D equals the square 
root of the sum over items or scales 
of the squares of the differences be- 
tween predictions and_ self-descrip- 
tions. In the analysis of variance 
approach, accuracy is dissected by 
showing that it consists of differences 
between Judges and Others in re- 
sponse set, in scatter over the items, 
in scatter over Others, and so on. 


de- 


Each of the arithmetic means in- 
troduced as a reference point from 
which to compute a component of 
accuracy (analogous to a component 
of variance) may be considered an 
intermediary key. For example, the 
mean self-description of all Others 
on all items, from which the response 
set component of accuracy is com- 
puted, may be considered an inter- 
mediary key interposed between pre- 
dictions and _ self-descriptions. In 
terms of Cronbach’s analysis, this 
key could be used to score the aver- 
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age of the Judge’s predictions over all 
items and all Others, yielding what is 
then called the elevation component 
of accuracy. As a second example, 
the differences between the mean 
self-description of one Other on all 
items and the mean self-description 
of all Others on all items can be used 
as an intermediary key, this time 
applicable to the predictions for a 
particular Other. Applied to the cor- 
responding differences between the 
Judge's average prediction for all 
Others and for the particular Other 
this key would vield measures of 
what Cronbach called differential 
elevation. A final example is a key 
consisting of the mean response of all 
Others on each item; applied to the 
mean prediction of a Judge for all 
Others on each item, this would yield 
a measure of the Judge's stereotype 
accuracy. 

Each intermediary key represents 
a reference point around ‘which a 
component of accuracy variance 
could be analyzed. Not only means 
over items, over Others, over Judges, 
or over any two of these may be used 
to dissect accuracy. 
termediary 


Rather, the in- 
formulation shows 
that psychologically or logically de- 
fined reference points can be used; 
for example, a reference point may 
be constructed consisting of all ‘‘fav- 


key 


orable” or all ‘‘adjusted’’ responses. 
Accuracy can then be investigated as 
a function of the concomitance be- 
tween predictions and_ self-descrip- 
and 
Which of 
the many possibilities the analyst of 
variance or the user of intermediary 
keys chooses will depend upon his 
purposes and insight. 


tions along the ‘‘favorability’”’ 
“adjustment’’ dimensions. 


SUMMARY 
The intermediary key consists of 
a protocol, e.g., a set of responses to 
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a questionnaire or rating scale. When ful in showing how accuracy in pre- 
interposed between the Judge's pre-  dicting Others’ responses can be gen- 
dictions and the Other's self-descrip- eral over items but not over Others. 
tions, the intermediary key sheds Similarly, they have revealed how 
light on the processes involved in the processes affecting accuracy in 
interpersonal perception. Eight illus- interpersonal perception may or may 
trative intermediary keys, drawn not be attributable to genuine per- 
from recent investigations, are de- ception of the Other by the Judge. 
scribed. Among the considerations The intermediary key approach is 
that may be used in evaluating inter-related to the analysis of variance 
mediary keys are the internal con- approach in analyzing interpersonal 
sistency of their scores, their degree perception because the reference 
of confounding with other keys, and points in both approaches may be 
the degree to which the processes re- considered points in a k-space defined 
vealed by such keys may be attrib- by the & items or scales of a ques- 
uted to social interaction between tionnaire or rating scale. The inter- 
Judge and Other, as against autistic mediary key can be used to define 
response sets or mathematical arti- psychologically as well as mathe- 
facts. matically meaningful reference 
Intermediary keys have been use- points. 
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